Switzer, Juliet 



Sent: 

To: 

Cc: 



Subject: 



From: 



Switzer, Juliet 

Monday, October 18, 2004 5:22 PM 
Fredman, Jeffrey 
O'Bryen, Barbara 
search request 



Hi Jeff, 



Please approve teh search request for more than 10 sequences. Thanks. 

PLEASE FORWARD DIRECTLY TO BARB OBRYEN. 

thanks. 
Juliet 



HI 

for 09/701626 

please search the following: 

1 . Please do an oligo search of seq id no 1 1 in PRIOR ART nuceic acid databases. 
Please limit hits to sequences of 100 nucleotides or fewer. 

2. Please search SEQ ID NO 84 in genbank and genseq databases, please return all 
100% hits. 

3. Please search seq id no 80, 81, 82, 83, 86, 87, 88, 89, 90, 91 in PRIOR ART 
databases and return all 100% hits (only 100% necessary). 

please return results on disk. 

thanks. 

Juliet Switzer 
Art Unit 1634 
phone: 571-272-753 
office: Remsen 2D75 



i 



Schreiber, David 



From: Switzer, Juliet 

Senft Tuesday, October 1 9, 2004 8:38 AM 

To: Schreiber, David 

Subject: RE: score/length search 09701626 



ifsued and published, not the pending database, thanks 

Original Message 

From: Schreiber, David 

Sent: Tuesday, October 19, 2004 7:04 AM 

T : Switzer, Juliet 

Subject: RE: score/length search 09701626 

When you say US patents you mean only issued, correct? 



David Schreiber, Ph.D. 

Scientific and Technical Information Center 

Biotech/Chem Library 

Remsen E01A61 

571-272-2526 



— Original Message 

From: Switzer, Juliet 

Sent: Monday, October 18, 2004 4:55 PM 

To: Schreiber, David 

Subject: FW: score/length search 09701626 



David -- 



Could you please perform a Score/Length sequence search for complements of SEQ 
ID NO 1 1, with a minimum hit length of 10 and a maximum hit length of 77, and a 
score over length cutoff at 70%. ONly do search in US Patents and GenSeq. 
Thanks. 

Please return results on disk. Please give me at least 100 hits and alignments. 

Thanks ! 



Juliet Switzer 
Art Unit 1634 
phone: 571-272-753 
office: Remsen 2D75 



i 



O'Bryen, Barbara 



From: Switzer, Juliet 

Sent: Tuesday, May 11, 2004 5:19 PM 

To: O'Bryen, Barbara 

Subject: please search 



for 09701 626 

seq id no 1 1, 84, and 85 in all nucleic acid databases, please return results on disk, 
thanks. 



Juliet 



Juliet Switzer 
Art Unit 1634 
phone: 571-~27>753 
v office: Remsen"2D75 





•1 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: May 13, 2 004, 19:34:52 ; Search time 1269.17 Seconds 

(without alignments) 
2629.598 Million cell updates/sec 

Title: US-09-701-62 6A-11 

Perfect score: 77 

Sequence: 1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table : 



I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3470272 seqs, 21671516995 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database : GenEmbl : * 

1 : gbjba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_OV : * 

6 : gb_pat : * 

7 : gb _ph : * 

8 : gb_pl : * 

9 : gb_pr : * 



10 


gb_ro : * 


11 


gb_sts : * 


12 


gb_sy : * 


13 


gb_un : * 


14 


gb_vi : * 


15 


em_ba : * 


16 


em_f un : * 


17 


em_hum : * 


18 


em_in : * 


19 


em_mu : * 


20 


em_om : * 


21 


em_or : * 


22 


em_ov : * 


23 


em_pat : * 


24 


em_ph : * 


25 


emj)l : * 


26 


em_ro : * 


27 


em_sts : * 


28 


em_un : * 



O Q 


em 


VI : * 


7 n 

J u 


em 


XiL.^ XI Lull . 


^ 1 

o J. 


Gill 


HL-^j Xllv . 


.5 Z 


em 






cm 


11 Ly lUUa . 


J 1 


Gin 


Vi 4- ft -r-\ ~| T1 « 


35 


6m 


Vita rnH * * 


36 


em_ 


_htg_mam : * 


37 


em_ 


_htg_yrt : * 


38 


em_ 


_sy : * 


39 


em_ 


_htgo_hum : * 


40 


em_ 


_htgo_mus : * 


41 


em_ 


_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


77 


100 


0 


77 


6 


BDz 112 87 


BD2 11287 


Method of 


2 


77 


100 


0 


18042 


1 


AY03 8186 


AY03 8186 


Pseudomon 


3 


75 . 4 


97 


9 


77 


6 


BD211300 


BD211300 


Method of 


4 


74 . 4 


96 


6 


77 


6 


BD211339 


BD211339 


Method of 


5 


74 . 4 


96 


6 


77 


6 


BD211344 


BD211344 


Method of 


6 


74 .4 


96 


6 


14143 


6 


BD211277 


BD211277 


Method of 


7 


74 .4 


96 


6 


14143 


6 


BD211278 


BD211278 


Method of 


8 


74 .4 


96 


6 


42143 


6 


BD211280 


BD211280 


Method of 


9 


72.8 


94 


5 


77 


6 


BD211395 


BD211395 


Method of 


10 


71. 8 


93 


2 


77 


6 


BD211295 


BD211295 


Method of 


11 


71. 8 


93 


2 


77 


6 


BD211331 


BD211331 


Method of 


12 


71.8 


93 


2 


77 


6 


BD211398 


BD211398 


Method of 


13 


71.2 


92 


5 


77 


6 


BD211353 


BD211353 


Method of 


14 


70.6 


91 


7 


77 


6 


BD211317 


BD211317 


Method of 


15 


69.8 


90 


6 


77 


6 


BD211354 


BD211354 


Method of 


16 


69.6 


90 


4 


77 


6 


BD211281 


BD211281 


Method of 


17 


69. 6 


90 


4 


77 


6 


BD211298 


BD211298 


Method of 


18 


69.2 


89 


9 


74 


6 


BD211374 


BD211374 


Method of 


19 


68 . 8 


89 


4 


7275 


1 


AY129393 


AY129393 


Pseudomon 


20 


68.6 


89 


1 


77 


6 


BD211297 


BD211297 


Method of 


21 


68.6 


89 


1 


77 


6 


BD211302 


BD211302 


Method of 


22 


68.6 


89 


1 


77 


6 


BD211316 


BD211316 


Method of 


23 


67 


87 


0 


77 


6 


BD211341 


BD211341 


Method of 


24 


66 


85 


7 


77 


6 


BD211324 


BD211324 


Method of 


25 


66 


85 


7 


7300 


6 


BD211279 


BD211279 


Method of 


26 


65.6 


85 


2 


77 


6 


BD211351 


BD211351 


Method of 


27 


64 


83 


1 


78 


6 


BD211325 


BD211325 


Method of 


28 


64 


83 


1 


78 


6 


BD211327 


BD211327 


Method of 


29 


62 .4 


81 


0 


78 


6 


BD211299 


BD211299 


Method of 


30 


61.8 


80 


3 


78 


6 


BD211320 


BD211320 


Method of 


31 


61.8 


80 


3 


78 


6 


BD211405 


BD211405 


Method of 


32 


61.8 


80 


3 


78 


6 


BD211406 


BD211406' 


Method of 


33 


61.4 


79 


7 


78 


6 


BD211314 


BD211314 


Method of 


34 


61.4 


79 


7 


78 


6 


BD211323 


BD211323 


Method of 



*3 C 
J D 


C 1 


7 Q 


7 


7 Q 


O 


BUZ il J 


T5Tl7 men 


riecnoa or 


J O 


DU . Z 


7 r 




7 ft 


D 


■pno 1 1 iic 

13 UZ llJ JJ 


"RT17 1 1 Tic 
DUZ 11j j J 




1 7 


e; q n 

D? . O 


/ / . 


7 


7 ft 
/ O 


£1 

D 


nno 117 qa 


Dn') 117 Q4 




"X ft 


c 7 




p 

> o 


7 ft 


D 


DnO 1 1 
DUZ 11 J 


T5T10 1 1 "lA"! 
E>U£ X ± j ft .3 




s y 


C £T Q 
DD . O 


7 1 


Q 

. o 


/ O 


- £■ 
D 


E>DZ 1 1 J lb 


nni 1 T "5 T C 

BDzllilb 


Method of 


40 


56.8 


73 . 


. 8 


5957 


1 


AY129392 


AY12 93 92 


Pseudomon 


41 


56.6 


73 . 


.5 


76 


6 


BD211288 


BD211288 


Method of 


42 


56 


72 , 


.7 


76 


6 


BD211290 


BD211290 


Method of 


43 


56 


72 . 


.7 


76 


6 


BD211348 


BD21134 8 


Method of 


44 


55.4 


71. 


.9 


76 


6 


BD211337 


BD211337 


Method of 


45 


55.4 


71. 


.9 


76 


6 


BD211349 


BD211349 


Method of 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title : 

Perfect score: 
Sequence : 



May 13, 2004, 18:33:43 ; Search time 484.569 Seconds 

(without alignments) 
675.056 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



6747726 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : N_Geneseq_2 9Jan04 : * 

1: geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6 : geneseqn2002s : * 

7: geneseqn2003as : * 

8 : geneseqn2003bs : * 

9 : geneseqn2003cs : * 

10 : geneseqn2 004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


77 


100 


.0 


77 


3 


AAZ44900 


Aaz44900 


P. 


2 


75.4 


97 


.9 


77 


3 


AAZ44913 


Aaz44913 


P. 


3 


74 .4 


96 


.6 


77 


3 


AAZ44952 


Aaz44952 


P. 


4 


74 .4 


96 


.6 


77 


3 


AAZ44957 


Aaz44957 


P. 


5 


74 .4 


96 


.6 


14143 


3 


AAZ44981 


Aaz44981 


P. 


6 


72 .8 


94 


.5 


77 


3 


AAZ44987 


Aaz44987 


P. 


7 


72 .4 


94 


.0 


74 


3 


AAZ88511 


Aaz88511 


P. 


8 


72 .4 


94 


.0 


74 


3 


AAZ88515 


Aaz88515 


P. 



alcali 
alcali 
alcali 
alcali 
alcali 
alcali 
alcali 
alcali 



Q 


71 


. 8 


9"* 


2 


77 


*3 
J 


AA744 908 


Aaz44908 


p . 


alcal i 




71 


. 8 


93 . 


2 


77 


3 


AAZ44990 


Aa744 9 90 


p . 


alcal i 


1 1 
X 1 


71 


. 8 


93 . 


2 


77 


o 


AAZ44 944. 


A^ 5^4 4 94 4 


p _ 


a 1 ca 1 i 


12 


71 


. 4 


92 . 


, 7 


74 


3 


AAZ8 8 505 


Aaz88505 


p , 


alcal i 


13 


71 


. 2 


92 . 


5 


77 


3 


AAZ44 966 


Aa-744966 


p _ 


al c a 1 i 


14 


70 


. 6 


91 . 


7 


77 


3 


AAZ44 9**0 


Aa^449^0 


p t 


al ca 1 i 


1 5 

X ~J 


70 


. 6 


91 . 


7 


133 82 


3 


AAZ44 984 


Aaz4 4 984 


p t 


dX O dX X 


1 6 


69 


Q 
• o 


90 


> o 


74 


-i 
■j 


AAZ8851 6 

riri£j O O J X O 


Aa !78851 6 

r\CL £t O O J X O 


p 


a 1 pal -i 
dx tax x 


17 


69 


. 8 


90 . 


. 6 


77 


3 


AAZ44 967 


Aaz4 4 967 


p t 


a 1 r*a 1 "i 
dx \* d x x 


1 8 


69 


• o 


90 


4 


77 




AAZ44894 
ririzj *± ** o 7 *± 


Ba ^ A A Q QA 


P 


al pal t 


1 9 




c. 
• o 


90 


4 


7 7 




A A744 91 1 

riri£j *x *± 7 X 1 


Aa 7 4A 911 


P 


al ca 1 i 


20 


69 


. 2 


8 9 . 


9 


74 


3 


AAZ88503 


Aa t-8 8 5 0^ 


p _ 


al c a 1 i 


2 1 


68 


. 6 


8 9 . 


( i 


77 


3 


AAZ44 91 0 


Aa 744 91 0 

net ^ t± *± 7 X \J 


p B 


a 1 pal t 
dx \* d x x 


22 


68 


. 6 


89 . 


, 1 


77 


3 


AAZ44 915 


xiQ £j *I 1 J X —J 


p _ 


al cal i 


23 


68 


. 6 


8 9 , 


( 1 


77 


•i 


AAZ4 4 9? 9 




p 


al pal "i 


24 




• o 


87 


Q 

• o 


74 


-a 


AAZ8 8 59 0 


Aa-788590 


P 

XT • 


a 1 fa 1 t 
CllLal X 


9 5 




67 


87 


o 

• \j 


77 


"3 


AAZ44 954 

riri£j *± *± 7 -J 1 


Aa ^4.4 9 54 


P 
XT . 


a 1 *^a 1 "S 
d X dx x 


26 




66 


8 5 


t 7 


77 


■3 


AA 7,4 4 9^7 


Aa -744 9^7 
rid Zi *± *± -7 O / 


P 
XT . 


a 1 fa 1 t 


27 




66 


a 5 


7 


7** 00 




AA744 989 

ririZj 11 J06 


Aa ?44 Q89 


P 


a 1 cal i 


& o 




66 


a 5 


7 


7 0 0 
/ J u u 


*3 


J\A7Ad 98 7 
riri£i 1170J 


Aa 744 QP3 


P 


alcal i- 


29 


65 


. 6 


85 . 


, 2 


77 


3 


AAZ44 964 


Aa 74 4 9 64 


p ^ 


a 1 fa 1 t 
d x \~ dx x 


30 




64 


83 . 




78 


3 


AAZ44 9**. 8 


Aa 744 9^ 8 


p ^ 


a 1 fa 1 t 
ct x v_> ct X X 


31 




64 


83 . 


, i 


78 


3 


AAZ4494 0 

riri£j t t j't v 


Aa 74 4 94 0 


p 


a 1 fa 1 "i 
d x t dx x 


32 


62 


. 4 


81 . 


. 0 


73 


3 


AAZ8 8 52 1 

riri£j O O -J ^ X 


Aa788591 
rid £iOO j^i 


P 

XT > 


a 1 fa 1 t 


33 


62 


_ 4 


8 1 . 


, 0 


78 


-3 


AAZ4491 9 

riri£j*± *± J X Z 


Aa 744 9 1 9 


P 

XT . 


a 1 f a 1 "i 


34 


61 


. 8 


8 0 , 


> J 


78 


"3 


AAZ44 997 

riritl** *± 7 -7 / 


Aa 744 997 


P 


alcal i 


35 


61 


. 8 


80 . 


. 3 


78 


3 


AAZ44 9*} 


Aa 744 9"} 
rid td *± *± _7 j j 


P 


al c a 1 1 


36 


61 


t 4 


7 9 


( 7 


7 8 


J 


A A744 9"} 6 

riri£i *± *± 7 J O 


Aa 744 Q"3 6 
rid Zi *± r± I/O O 


P 

Xr . 


al cal i 


37 


61 


# 4 


7 9 


t 7 


7 8 


3 


AA744 99 7 


Aa 744 99 7 
rid / 


p 


al cal l 


3 8 




61 


7 9 


. 2 


78 


o 


AA74 4 96 

1ti7D J 


Aa 744 96*3 
rid 61170 J 


p 

XT . 


a 1 cal i 




O U 


n 
. z 


7 Q 


> z 


7 Q 


1 
J 


"7 A A OA Q 


AdZ44 y4o 




alcali 


40 


59 


.8 


77, 


.7 


75 


3 


AAZ88508 


Aaz88508 


P. 


alcali 


41 


59 


.8 


77, 


.7 


78 


3 


AA244907 


Aaz44907 


P. 


alcali 


42 




58 


75. 


.3 


74 


3 


AAZ88509 


Aaz88509 


P. 


alcali 


43 


57 


.6 


74, 


. 8 


78 


3 


AAZ44956 


Aaz44956 


P. 


alcali 


44 


56 


.8 


73 , 


. 8 


78 


3 


AAZ44928 


Aaz44928 


P. 


alcali 


45 


56 


.6 


73 , 


.5 


75 


3 


AAZ88514 


Aaz88514 


P. 


alcali 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



May 13, 2004, 20:14:57 ; Search time 110.853 Seconds 

(without alignments) 
385.475 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



1365418 



Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB. seq:* 

6 : /cgn2_6/ptodata/2/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 




1 


24 .4 


31.7 


576 


4 


US- 


09- 


252 


-991A-1507 


Sequence 


1507, Ap 


c 


2 


24.4 


31.7 


876 


4 


US- 


09- 


252 


-991A-1565 


Sequence 


1565, Ap 




3 


24 .4 


31 . 7 


1512 


4 


us- 


09- 


252 


-991A-1551 


Sequence 


1551, Ap 


c 


4 


24 .4 


31.7 


1677 


4 


US- 


09- 


252 


-991A-1616 


Sequence 


1616, Ap 




5 


24 .2 


31.4 


3796 


2 


US- 


08- 


762 


-308-11 


Sequence 


11, Appl 




6 


24 .2 


31.4 


3796 


' 4 


US- 


09- 


844 


-634-10 


Sequence 


10, Appl 


c 


7 


23.4 


30.4 


12665 


4 


US- 


08- 


961 


-527-134 


Sequence 


134, App 


c 


8 


23 


29.9 


1212 


4 


US- 


09- 


134 


-000C-1452 


Sequence 


1452, Ap 




9 


23 


29.9 


2124 


4 


US- 


09- 


252 


-991A-7474 


Sequence 


7474, Ap 




10 


22.6 


29.4 


50341 


1 


US- 


08- 


247 


-901C-1 


Sequence 


1, Appli 




11 


22.6 


29.4 


50341 


2 


US- 


09- 


075 


-904-1 


Sequence 


1, Appli 




12 


22.6 


29.4 


52297 


3 


US- 


09- 


426 


-436-1 


Sequence 


1, Appli 





13 


22 


. 6 


2 9 


, 4 


52297 


4 


US- Oft -705-557-1 


Sequence 


1, Appli 




14 


22 


. 4 


29 


. i 


3870 


1 


US - 08-118-641-1 


Sequence 


1, Appli 




15 


22 


. 4 


2 9 , 


, i 


3 870 


1 


US 

»J O U O X J O -L J J X 


Sequence 


1, Appli 




16 


22 


. 4 


29 


, 1 


3893 


1 


US -08-1 18-641 -1 


Sequence 


3, Appli 




17 


22 


. 4 


29 


. i 


3893 


1 


v O V O X ^ O X J J J 


Sequence 


3 # Appli 




18 


22 


. 4 


29 


, 1 


4403 765 


3 


US- 09-1 01-840A-2 


Sequence 2, Appl: 


c 


19 


22 


. 4 


29 . 


, 1 


4403765 


3 


US- 09- 103 -84 OA- 2 


Sequence 2, Appl: 




20 


22 


. 4 


29 . 


. 1 


4411529 


3 


US- 09- 103 -840A-1 


Sequence 1 , Appl: 


c 


21 


22 


. 4 


29 , 


. 1 


441152 9 


3 


US- 09- 103 -84 0A-1 


Sequence 1, Appl; 


Q 


22 


22 


. 2 


28 , 


> 8 


549 


4 


US -09-252-991A-12922 




1 2 922 A 


Q 


23 


22 


. 2 


28 , 


, 8 


774 


4 


US -0 9- 107 -53 2 A- 1786 




1 7ff 6 &r\ 




24 


22 


.2 


28 , 


. 8 


1308 


4 


US- 0 9-2 52 - 991A-1 2 869 




1 2 ft 69 A 


Q 


25 


22 


.2 


2 8 , 


, 8 


1800 


x 


US -08-260-202A-10 




1 O Ar^nl 


Q 


26 


22 


.2 


28 . 


, 8 


1800 


1 


US -08-017-114-10 

wij W U w X / XX^X X \J 




1 0 Arm! 


Q 


27 


22 


.2 


28 . 


. 8 


1800 


3 


US -08-505-107-1 0 




1 H Ann 1 
xU , rt.ppx 


c 


28 


22 


.2 


28 . 


. 8 


1800 


4 


US -0 9 -609- 151A-10 




1 0 Arrnl 


c 


29 


22 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 



May 13, 2004, 21:09:03 ; Search time 441.422 Seconds 

(without alignments) 
791.600 Million cell updates/sec 



US-09-701-626A-11 
Perfect score: 77 

Sequence: 1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table : 



Searched: 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

2947324 seqs , 2269024515 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



5894648 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB.seq:* 

3 : /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB. seq: * 
10 : /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB.seq: 
11 : /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. seq: 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/l/pubpna/US0 9_NEW_PUB. seq2 : 
14 : /cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB. seq: 
15 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. seq: 
16 : /cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB. seq: * 
17 : /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 
18 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB . seq : * 
19 : /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



May 13, 2004, 20:21:03 ; Search time 4145.39 Seconds 

(without alignments) 
554.686 Million cell updates/sec 



Title: US- 09 -701- 62 6A- 11 

Perfect score: 77 

Sequence: 1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 

Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 



55026578 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



EST: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 



em_estba:* 
em_esthum: * 
em_estin: * 
em_estmu : * 
em_estov: * 
em_estpl : * 
em_estro : * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_estom: * 
em_gss_hum: * 
em_gss_inv: * 
em_gss_j?ln: * 
em_gss_vrt : * 
em_gss_fun: * 
em_gss_mam: * 
em_gss_mus : * 
em_gss _pro : * 
em_gs s_rod : * 
em_gs s_phg : * 
em_gss_vrl : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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.2 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 22, 2004, 04:50:28 ; Search time 1491 Seconds 

(without alignments) 
2442.191 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table: OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 

Searched: 4526729 seqs, 23644849745 residues 

Word size : 0 

Total number of hits satisfying chosen parameters: 2172512 

Minimum DB seq length: 0 
Maximum DB seq length: 100 

Post-processing: Listing first 45 summaries 

Database : GenEmbl : * 

1 : gb_ba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_OV : * 

6 : gb_pat : * 

7 : gb_ph : * 

8 : gb_pl : * 
9 : gb_j?r : * 
10: gb_ro:* 

11: gb_StS:* 

12: gb_sy:* 
13: gb_un:* 
14 : gb_vi : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



NO. 


Score 


Match 


Length DB 


ID 


Description 
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6 
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luf 4* 


or 


12 


54 


70 




77 


o 


ono 1 1 OQQ 
£>1/Z il£ 70 


1 T)QQ 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 04:25:35 ; Search time 240 Seconds 

( wi t hou t a 1 i gnmen t s ) 
1684.189 Million cell updates/sec 

Title: US-09-701-626A-11 
Perfect score: 77 

Sequence: 1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table: OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 

Searched: 4134886 seqs, 2624710521 residues 



Word size : 0 

Total number of hits satisfying chosen parameters: 4343386 



Minimum DB seq length: 0 
Maximum DB seq length: 100 



Post-processing: Listing first 45 summaries 

Database : N_Geneseq_23Sep04 : * 

1 : geneseqn!980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2 0 03as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 22, 2004, 05:21:44 ; Search time 63 Seconds 

(without alignments) 
868.742 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table: OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 



Searched: 
Word size 



824507 seqs, 355394441 residues 
0 



1022532 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 10 0 

Post-processing: Listing first 45 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq : * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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No. 


Score 
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Match Length DB 


ID 


Description 
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Oft O O O 


Q A 1 r™i « 

occjuence 


OO , AppX 


c 


1 7 
£. 3 




XO . 


c 
D 


4 u 


X 


TTC 

Uo - 


(J o 


A c a 


1 A A CC 
-344-00 


Sequence 


OO , Appl 


V— 


94 

Z *± 


1 9 
x z 


1 R 
X 3 . 


D 


4.9 


3 


TTQ - 
UO 


0 Q 
vj y 


o*t ^ 


- fl R7 - 9 1 
O O 3 Z X 




91 Annl 
Z X , AppX 




9 R 
z j 


1 9 
x z 


1 R 
X 3 . 


o 


R 7 

3 3 


*± 


TTQ - 
UO 


fifl 




-171TT-7417 
X / XC* 3flX3 


ocquencc 


7 4 17 Tar\ 
3 ft 1 3 , Ap 




9fi 


1 9 
x ^ 


1 R 

X 3 • 


D 


R"} 
3 3 


4 


TTQ - 
u o 


u o 


- 7 R 1 
/ox 


- QR fiZi - 74 1 7 
?ODn 3 *± X 3 


C A/^n 1 f* a 

ocquencc 


7 41 7 2ir^ 
3 ft X 3 / Ap 




9 7 
z / 


1 9 
x z 


1 R 

X 3 . 


D 


R4 

3 T± 


r* 


TTQ - 




~ Z X 3 


- PI R9 - 7 
U O Z 3 


Q ^ /T1 1 r™i « 

ocquencc 


3 , Appli. 


C 


9 fl 
z o 


-1 9 

X Z 


1 R 

ID . 


C 
D 


3 ft 


*± 


TTQ - 
UO 


n q 


-917 
Z X 3 


_ n R 9 -4 
U O Z - ft 


oc que nee 


ft , AppXx 


r* 


9 9 
z ^ 


1 9 

X Z 


1 R 
X o . 


D 


67 


*± 


TTQ - 

uo 


n q 


- R 1 7 
3X3 


_QQQP_1 n AH Q 

yyy\^ X / ft / o 


oequence 


1 1 A 1 Q 7i 

x /ft / 0 , A 






1 9 
xz 


1 R 
XO . 


a 
O 


1 u 


± 


TTC _ 

uo - 


n ci 
u 0 


_474 


_nni _ 1 71 
-UUX-X3X 


Sequence 


1 7 1 a«h 

X 3 1 , App 




3 X 


1 9 

xz 


1 R 
XO . 


c 
O 


i u 


X 


TTC - 
Uo - 


n Q 
u 0 


_ A 1 "3 
-*fc 3 3 


coc 17 1 
-O0O-X3I 


Sequence 


X 3 X , App 




7 9 
o z 


1 9 
XZ 


1 R 


O 


/ u 


X 


TTC - 
UO 


u O 


-414 


- flZ 0 ~ X3 X 


Sequence 


X 3 X , App 




7 7 
J 3 


1 9 
X z 


1 R 


o 


7 fi 


9 
z 


TTC - 
UO 


08 


-437 


-C^7_1 71 
— DO/-X3X 


Sequence 


17 1 Ann 

X 3 X , App 


c 


7 A 
3 ft 


1 9 
XZ 


13 . 


O 


/ u 


Z 


TTC 

Uo - 


08 


-894 


-0 / y-114 


Sequence 


114 , App 




7 R 
J 3 


1 9 
X Z 


1 R 
XO . 


C 
D 


1 u 


-3 
3 


TTC 
Uo - 


08 


-906 


QC c 17 1 

-yoo-xjx 


Sequence 


X 3 X , App 




7 £T 
3 D 


1 9 
X Z 


1 R 
XO . 


/: 

D 


1 U 


3 


TTC 

Uo - 


08 


-945 


O A G 1 T 1 

- y U y - X3 1 


Sequence 


131, App 




j / 


1 9 
X z 


1 R 


D 


7 n 


3 


TTQ - 
UO 


09 


-396 


.nnoB-i 71 

UUZrt 1 j X 


oequence 


17 1 Ann 

X 3 X t App 




^ fl 
J o 


1 9 
X z 


1 R 


£L 

o 


7 n 




TTC - 
UO 


10 


-077 


-71 Q-1 71 


oequence 


17 1 Ann 

X 3 X , App 




39 


12 


15 . 


g 


70 


5 


PCT 


'-US96- 


UOU OU X3X 




171 Ann 
X 3 X , App 




40 


12 


15. 


6 


82 


4 


US- 


09 


-513 


-999C-35076 


Sequence 


35076, A 


c 


41 


12 


15 . 


6 


91 


4 


US- 


09 


-513 


-999C-16283 


Sequence 


16283, A 




42 


12 


15. 


6 


94 


4 


US- 


09 


-621 


-976-13958 


Sequence 


13958, A 




43 


11 


14. 


3 


11 


4 


US- 


09 


-723 


-942-31 


Sequence 


31, Appl 




44 


11 


14. 


3 


15 


1 


US- 


08 


-182 


-968A-331 


Sequence 


331, App 




45 


11 


14. 


3 


15 


1 


US- 


08 


-182 


-968A-333 


Sequence 


333, App 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 22, 2004, 05:58:14 ; Search time 242 Seconds 

(without alignments) 
1630.339 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table: OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 



Searched: 
Word size 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 100 

Post-processing : Listing first 45 summaries 



2126116 



Database 



Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq:* 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: * 

9 : /cgn2_6 /ptoda t a/ 2 /pubpna/US 0 9A_PUBCOMB . seq : 

10 : / cgn2_6/ptodat a/ 2 /pubpna/US 09B_PUBCOMB. seq 

11: /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB.seq 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq 

16 : /cgn2_6/ptodata/ 2 /pubpna/US 10D_PUBCOMB . seq 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq 

18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB.seq: 

19 : /cgn2_6/ptodata/ 2 /pubpna/US 11_NEW_PUB . seq: 

20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq: 

21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 





1 


19 


24 


9 7 


34 


15 


US -10-407-637-29 


Sequence 


29, Appl 




z 


X j 


1 

X 0 


Q 
• J 


9 R 

Zi J 


15 


tto _ n 0-098-26**T3-9RR7*} 


Sequence 


95573, A 


r* 

\*> 




13 


16 


m 9 


25 


15 


US -10-098-263B-12212R 


Sequence 


122128, 






13 


16 


r 9 


97 


10 


US-09-8R1-4 86-76 


Sequence 


76, Appl 


r* 


R 


13 


16 


b 9 


97 


15 


US -10-223-666-165 


Sequence 


165, App 




o 


12 


15 


. 6 


15 


9 


US- 09-504 -231A-354 


Sequence 354, App 




7 


1 9 

X Z 


1 R 

X ZJ 


* 0 


1 R 

X O 


9 


US- 09-274 -RR1D-^R4 


Sequence 354, App 




g 


12 


15 


. 6 


17 


10 


US-09-740-332 -15 04 


Corn iotipd 
OCVjUcuUc 


1 RH4 5n 




9 


12 


15 


. 6 


17 


10 


US -09-740-332 -3051 


Com 1 on e+ o 

ocqu.cii.vjc 






i n 

J- V 


1 9 
x z 


i r 

X 3 


. 0 


1 7 
x / 


X u 


TTQ - 09-R1 7-R7Q-1 RD4 

UO U-7 OX/ 0/-7 XCJU*± 


ocqucxiuc 


1 R C\A 

X O U rt , -"-P 




X X 


1 9 

X Z 


1 R 

X O 


• O 


17 


1 0 

X V/ 


US - 09-R1 7-R79-^0R1 

VJO V/? OX/ O / J O V J X 


Cam ipnPP 
OCVJ U.C11UC 


jvjJL/ i~lU 




1 9 

X Z 


1 9 
X z 


1 R 

X 3 


* O 


1 7 
x / 


1 7 

X / 


US -1 0-fifi9-R41 -40Q7 

UO lv OO J 0*±X *± U -7 / 


ocquciiuc 


4HQ7 A-ni, 


\* 




1 9 
X z 


1 R 

X 3 


• O 


17 


17 


US - 1 fi -fifi 9- R4 1 -Rfi44 
wo xy 00^ o*±x oo*±*± 


oc quelle C 


R^4 4 




1 4 
x *± 


1 9 

X Z 


1 R 

X D 


■ O 


99 

Z Z 


1 R 

X J 


TTS-1 0-09fi-R7R-fl*} 

UO XVJ U JO J / O OJ 




o*a Ann! 

O j , npp X 


C 


1 R 

X 3 


1 9 
x z 


1 R 
X D 




99 
p z z 


1 R 

X 3 


TTS-1 0-^1 ^-fifi9-940 

UO 1U OXO OD-7 Z> *t U 


ocqucnuc 


z *± u , /vpp 




X D 


1 9 
X z 


1 R 
X D 


. O 


99 

Zi Z 


1 R 
X 0 


TTS-1 0-ft^9-RRR-RRQ4 


Com i on f~* 0 

oequence 


D D ft , Zip 




1 7 

X / 


1 9 
X z 


1 R 

X 3 


. 0 


9 ^ 
z 0 


1 4 
x*± 


TTS -10-147-^R4-19 

UO XV/ X*± / J J*± xz 


ocquciiuc 


1 9 AtvtiI 
xz, t H.ppi 




1 R 
X o 


1 9 

X Z 


1 R 

X 3 


• 0 


9^ 

Z J 


1 R 

X O 


TTS -1 0-147-^99-1 9 

UO XV X*x / xz 


ocqucncc 


1 9 Ann 1 


/~> 

u 


1 9 

X ^ 


1 9 

X z 


1 R 

X 


• O 


9 R 
z> 3 


1 R 
X 3 


TTS -1 0-09fl-9fi^R-^^9R^ 

UO XV/ \J J kj ZOOO J J 7JJ 


OCvJ UCXiuC 


j J j , M. 


c 


z v/ 


1 9 
X z 


1 R 

X J 


< 0 


9 R 
z 0 


1 R 

X 3 


tto _-| 0-D9fl-9fi'}'R-R9QR9 

UO XV/ V/ -7 O ZOJO 


C O /^"l 1 O T"t /"> O 

OCV^UCilUC 


R Q Q R 9 a 
O J7 7 D Z , /-V 




2 1 


12 


15 


. 6 


25 


15 


US -10-09R-26^'B-7R024 


OC VJ UClluC 


7 RH94 a 
/ O U Z *± , r\ 




22 


12 


15 


. 6 


25 


15 


TTS -1 0-09fl-26"*'R-7ROR7 

UO XV/ U J O ZOOO / O \J O 1 


C 0 m i 0 t"i 0 
OC VJ UCIluC 


7PHQ7 a 
/ 0 v 0 / , -rt. 


Q 


23 


12 


15 


. 6 


25 


15 


TTS-1 0-09R-9fi^R-RR24R 

UO XV/ \J -7 \J L U J 00^^3 


C 0 /ti 1 0 n /"> 0 
OCU^UCIIUC 


R R 9 4 R a 
0 OZ ft O , 


Q 


24 


12 


15 


. 6 


25 


15 


US -10-09R-26^B-122127 

UO XV/ V/^O X^^X^/ 


C 0 rfi 1 a n y™t d 

ocqucncc 


T 9 9 1 *5 7 
XZ Z xZ / , 




25 


12 


15 


. 6 


2R 

Zi J 


1 R 

X O 


TTS-1 0-09R-9fi^R-1 97^49 

UO XV/ V/JO ZOOO X ^« / ^ 


ocqucncc 


T O 7 1 A Q 
l.z; / O fl y , 




26 


12 


15 


. 6 


25 


15 


US -10-098-26*5R-128968 


ocqiiciicc 




Q 


2 7 


12 


15 


. 6 


31 


1 R 

X ~J 


US -10-1 Rfi-^Ofi-^2^0 

UO XV/ X^O JUO ozov/ 


ocquence 


•5 O O A An 

j^jU, Ap 




28 


12 


15 


. 6 


31 


16 


TTS-1 0-1 ^8-674-1 904^ 

UO lv XJO O/^ Uvl J 


Sequence 


1 Q n A "5 B 


Q 


2 9 


12 


15 


. 6 


31 


17 


TTS-1 0-2R7-949A-1 904** 

UO XV / ^ ZJ r\ 1 Jvi J 


C 0 /™n 1 0 n /"> 0 
O CV^UCilUC 


X U fx J , M. 




3 0 


12 


15 


. 6 


32 


Q 


TTS - 09 - R9R-*^01 - R0 

UO U -7 J OV/X JU 


Sequence 1 


50, Appl 




3 1 


12 


15 


. 6 


32 


9 


TTS - 09 - 7 R7 -417-10 

UO V/J 1 ~J 1 *±X/ XV/ 


Sequence 


10, Appl 




9 
j z. 


12 


1 R 

X O 


• 0 


"*9 
J z 


1 4 

XI 


tto -Tf)-n49-94R-1 fl 

UO XV/ V/1Z -7*±-J XV/ 


Sequence 


10, Appl 




3 3 


12 


1 R 

X _> 


• 0 


"39 

O z 


1 R 

X 3 


TTS -1 0-0^^-R97-R0 

UO XV/ V/OJ 3 Zi / jU 


Sequence 


50, Appl 




34 


12 


1 R 

X 


• O 


^9 
J z 


1 R 
x 0 


TTS -1 0-770-fifiR-R^ 

UO XV/ //V/ OOO -J J 


Sequence 


53, Appl 




^ R 

O J 


1 9 

X 46 


1 R 

X O 


* O 


O D 


1 R 
x 0 


TTS -1 n-77n-fifiR-RR 

UO XV/ 1 1 \J OOO OD 


Sequence 


55, Appl 




36 


12 


15 


. 6 


37 


14 


US -10-002-784A-20 


Sequence 


20, Appl 


C 


37 


12 


15 


.6 


37 


14 


US-10-002-784A-21 


Sequence 


21, Appl 


C 


38 


12 


15 


.6 


42 


16 


US-10-082-714-21 


Sequence 


21, Appl 




39 


12 


15 


.6 


50 


16 


US-10-131-827-1507 


Sequence 


1507, Ap 




40 


12 


15 


.6 


53 


8 


US-08-781-986A-3413 


Sequence : 


3413, Ap 




41 


12 


15 


.6 


53 


16 


US-10-329-624-3413 


Sequence 


3413, Ap 




42 


12 


15 


.6 


54 


14 


US-10-061-979-3 


Sequence 


3, Appli 


C 


43 


12 


15 


.6 


54 


14 


US-10-061-979-4 


Sequence 


4, Appli 




44 


12 


15 


.6 


60 


10 


US-09-908-975-6734 


Sequence 


6734, Ap 


C 


45 


12 


15 


.6 


60 


. 10 


US-09-908-975-12898 


Sequence 


12898, A 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score ; 
Sequence : 



October 22, 2004, 05:20:29 ; Search time 1633 Seconds 

(without alignments) 
1718.224 Million cell updates/sec 

US-09-701-626A-11 
77 

1 acctaacaactggttcaagt cccttaaccaaacgttaggc 77 



Scoring table : 0LIG0_NUC 

Gapop 60.0 , Gapext 60.0 

Searched: 32822875 seqs, 18219865908 residues 

Word size : 0 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 100 

Post-processing: Listing first 45 summaries 



664238 



Database 



EST: * 



1 


gb 


_estl:* 


2 


gb 


_est2 :* 


3 


9b- 


htC : * 


4 


gb 


_est3 :* 


5 


gb 


~est4 :* 


6 


gb 


_est5 :* 


7 


gb 


_est6 :* 


8 


gb 


_gssl : * 


9 


gb 


_gss2 : * 


is 


3 the 


number 



Pred. No. 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 





NO. 


Score 


Match Length DB 


ID 


Description 


c 


1 


14 


18 .2 


93 


4 


BG075044 


BG075044 H3142F04- 




2 


13 


16.9 


52 


1 


AA823097 


AA823097 vw40f06.r 




3 


13 


16.9 


68 


9 


CL213924 


CL213924 W204C06 G 


c 


4 


13 


16.9 


75 


1 


AU259835 


AU259835 AU259835 


c 


5 


13 


16.9 


92 


9 


AB082041 


AB082041 Drosophil 


c 


6 


12 


15.6 


32 


2 


BF131807 


BF131807 601820724 




7 


12 


15.6 


34 


1 


AA939151 


AA939151 oq07ell.s 


c 


8 


12 


15.6 


46 


1 


AI000352 


AI000352 ovlldlO.s 




9 


12 


15.6 


47 


8 


BZ665531 


BZ665531 EY00954-3 





10 


12 


15 


. 6 


49 


x 


AT 6 9 72 2 5 


A169729 R 


1~rr1 7H08 v 




11 


12 


15 


. 6 


49 


9 


BX661579 


BX661579 


nX uJJ X \JL\J LJ 0 


Q 


12 


12 


15 


. 6 


50 


X 


AIJ1 06 0 01 


ATT1 060 01 


ATT1 06001 


Q 


13 


12 


15 


. 6 


51 


7 


CN8501 99 


PN8 501 99 


0 00 91 7 AAF 


Q 


14 


12 


15 


. 6 


53 


4 


BM1 1711 A 


dmi 7711 A 


T.O R 1 R A09 - 

LIU J XJnv 




15 


12 


15 


. 6 


54 


7 


PR425327 


PP4 9 c;T97 


PR 4 9 R 7 9 7 


Q 


16 


12 


15 


. 6 


58 


4 


BI4 93 871 

JJ X *4 ^7 J O / X 


RT497871 

DX1 JJO / J. 


H f 1 06a 0 9 


Q 


17 


12 


15 


. 6 


59 


9 


CR2 75 8 70 

V*X\.b / _S (J / V 


PP9 7 R87 0 


ruiwaxu s 




18 


12 


15 . 


. 6 


59 


9 


PG4 66 07 9 


PG4 6 60 7 9 


Ol Q0R7Q-0 




19 


12 


15 . 


. 6 


60 


. 4 


BG099767 


RG099767 


LLA.O O X x x . y 


c 


20 


12 


15 . 


. 6 


60 


9 


PG408854 


PG4 0 88R4 

V^VJ*± UOO JT: 


JJO J*i / XJ 0 




21 


12 


15 , 


. 6 


61 


3 


BZ661780 


RZ661 78 0 


OAT.7 09R9 


c 


22 


12 


15 , 


. 6 


62 


9 


CL246534 


PL24 6574 


01 ^0R69 - 0 




23 


12 


15 . 


. 6 


66 


2 


BF130886 


RF1 7 0886 


601 81 9680 




24 


12 


15 . 


. 6 


66 


8 


AZ44 8 881 


A 7.44 8 881 

nul 1 O O O X 


1 M094. 6ZH99 




25 


. 12 


15 , 


. 6 


66 


9 


BX661 ^7 8 


BX661 R7ft 


M.X aJJ X UvJ^J fc> 


c 


26 


12 


15 . 


. 6 


67 


9 


BX960 82 0 


ny 9 6 0 8 9 0 


I^CVCXsC O 




27 


12 


15 , 


. 6 


69 


9 


PG 1 ^ 95 97 


PG 5 9 95 9 7 


ACTI 1 09 R 1 


Q 


2 8 


12 


15 . 


. 6 


72 


9 


V.I\J J / ^ ^ «J 


HP"! 0*79 9 "1 


/\x a.x) 1 UOp S 


Q 


2 9 


12 


15 . 


. 6 


73 




PA 5 84 9R9 


pa 5 p a qe; 9 


XjJDJJU Xft O O • 


Q 


30 


12 


15 . 


. 6 


74 


9 


PGR 96771 


prjc QC r 7'7 1 
J7 D / / X 






31 


12 


15 . 


. 6 


75 


x 


AA9 91 974 


AAQQI 07/ 


UlU xJJU 0 . s 




32 


12 


15 , 


, 6 


77 


3 


■LJli O X X ^ ? J 


DUO 1 1 O Q R 






33 


12 


is . 


. 6 


78 


2 


AW2 5 0424 


AW9 50494 


9899710 7 




34 


12 


15 . 


. 6 


78 




CD289266 


CT)7 8 99 66 


9 TCI aKH 




35 


12 


15 . 


. 6 


78 


8 


AF149686 


AF14 9686 


AF 14 968 6 


c 


36 


12 


15 . 


, 6 


78 


3 


AZ99 1 778 


Z\799 177 8 




c 


37 


12 


15 . 


. 6 


83 


3 


BZ7 784 95 




Q2iT.PT 1 fi ft 1 


c 


38 


12 


15 . 


. 6 


85 


3 


AZ6082 91 


Z\ 76 0 89 91 


1 MO/1 ITMI "7 




39 


12 


15 . 


. 6 


87 


9 


CR397846 


CR397846 


Arab i dons 




40 


12 


15 . 


,6 


88 


1 


AA104061 


AA104061 


mpOldlO . r 




41 


12 


15, 


.6 


88 


9 


CR397847 


CR397847 


Arabidops 


C 


42 


12 


15. 


.6 


89 


8 


BZ593785 


BZ593785 


SALK_082 0 




43 


12 


15 . 


.6 


90 


9 


AL943231 


AL943231 


Arabidops 




44 


12 


15 . 


.6 


92 


1 


AA595539 


AA595539 


ni47a05 . s 




45 


12 


15 . 


.6 


92 


9 


AL946063 


AL946063 


Arabidops 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:27:10 ; Search time 823.508 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-80 
43 

1 taacaattggttcaagtcgt cttcgctcactgcgggaccg 43 



Scoring table : 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 4526729 seqs, 23644849745 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : GenEmbl : * 



1 


gb_ba : * 


2 


gb_htg : * 


3 


gb_in : * 


4 


gb_om : * 


5 


gb_ov : * 


6 


, gb_pat : * 


7 


gb_ph : * 


8 


gb_j>l : * 


9 


gb_pr : * 


10: gb_ro:* 


11: gb_sts:* 


12: gb_sy:* 


13: gb_un:* 


14 : 9k_ v i : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 43 100.0 

2 43 100.0 

3 43 100.0 



43 6 BD211356 
78 6 BD211314 
78 6 BD211323 



BD211356 Method of 
BD211314 Method of 
BD211323 Method of 



4 43 100.0 7300 6 BD211279 

5 43 100.0 42143 6 BD211280 



BD211279 Method of 
BD2112 80 Method of 



GenCore version 
Copyright (c) 1993 - 2004 



5.1.6 

Cotnpugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score : 
Sequence : 



October 22, 2004, 06:02:20 ; Search time 198.73 Seconds 

(without alignments) 
1135.840 Million cell updates/sec 

US-09-701-626A-80 
43 

1 taacaattggttcaagtcgt cttcgctcactgcgggaccg 43 



Scoring table: IDENTITY__NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters: 6 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing : 



Minimum Match 
Maximum Match 
Listing first 



100% 
100% 

100 0 summaries 



Database : N_Geneseq_2 3Sep04 : * 

1 : geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2 000s : * 

4: geneseqn2001as : * 

5 : geneseqn2 001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 

11: geneseqn2003ds : * 

12 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


43 


100 


. 0 


43 


3 


AAZ44969 


Aaz44969 


P. 


alcali 


2 


43 


100 


.0 


78 


3 


AAZ44936 


Aaz44936 


P. 


alcali 


3 


43 


100 


. 0 


78 


3 


AAZ44927 


Aaz44927 


P. 


alcali 


4 


43 


100 


.0 


7300 


3 


AAZ44982 


Aaz44982 


P. 


alcali 


5 


43 


100 


. 0 


7300 


3 


AAZ44983 


Aaz44983 


P. 


alcali 



6 43 100.0 13382 3 AAZ44984 



Aaz44984 P. alcali 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 



October 22, 2004, 06:55:41 / Search time 42.1865 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-80 
43 

1 taacaattggttcaagtcgt cttcgctcactgcgggaccg 43 



Scoring table: 



I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/ 1/ ina/5A_COMB . seq : * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4: /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq : * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 ; Search time 172.814 Seconds 

(without alignments) 
1274.950 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-80 
43 

1 taacaattggttcaagtcgt cttcgctcactgcgggaccg 43 



Scoring table : 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3407233 seqs, 2561960514 residues 

Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB.seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB. seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq:* 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB . seq : * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : * 

16 : / cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB . seq : * 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq: * 

18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

19 : /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 

20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq : * 

21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 

score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result Query 

No. Score Match Length DB ID Description 



No matches found 



Run on: 



October 22, 2004, 06:54:36 ; Search time 1764.05 Seconds 

(without alignments) 
888.247 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-701-626A-80 
43 

1 taacaattggttcaagtcgt cttcgctcactgcgggaccg 43 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



32822875 seqs , 18219865908 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



EST: * 



gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1764.05 sees 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:27:10 ; Search time 823.508 Seconds 

(without al ignment s ) 
2469.262 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-81 
43 

1 taactattcagtcaagcgga aaaccccgctgcgcggtctt 43 



Scoring table: 



I D ENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



4526729 seqs, 23644849745 residues 



Total number of hits satisfying chosen parameters: 5 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : GenEmbl : * 



1 




gb_ba : * 


2 




gb_htg : * 


3 




gb_in : * 


4 




gb_om : * 


5 




gb_ov : * 


6 




gb_pat : * 


7 




gbjph : * 


8 




gb_pl : * 


9 




gb_pr : * 


10 : 


gb_ro : * 


11: 


gb_sts : * 


12 : 


gb_sy : * 


13 : 


gb_un : * 


14 : 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 43 100.0 43 6 BD211357 BD211357 Method of 

2 43 100.0 72 6 BD211340 BD211340 Method of 

3 43 100.0 14143 6 BD211277 BD211277 Method of 

4 43 100.0 14143 6 BD211278 BD211278 Method of 



5 43 100.0 42143 6 BD211280 



BD211280 Method of 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 



Compugen Ltd. 



OM nucleic 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



Searched: 



nucleic search, using sw model 

October 22, 2004, 06:02:20 ; Search time 198.73 Seconds 

(without alignments) 
1135.840 Million cell updates/sec 

US-09-701-626A-81 
43 

1 taactattcagtcaagcgga aaaccccgctgcgcggtctt 43 

I DENT I T Y_NUC 
Gapop 10.0 , Gapext 1.0 

4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 3 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6 : geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 

11: geneseqn2003ds : * 

12 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 


43 


100 


.0 


43 


3 


AAZ44970 


Aaz44970 


P. 


alcali 


2 


43 


100 


. 0 


72 


3 


AAZ44953 


Aaz44953 


P. 


alcali 


3 


43 


100 


.0 


14143 


3 


AAZ44981 


Aaz44981 


P. 


alcali 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 22, 2004, 06:55:41 ; Search time 42.1865 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-81 
43 

1 taactattcagtcaagcgga aaaccccgctgcgcggtctt 43 

I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 824507 seqs, 355394441 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Issued_Patents_NA: * 

1: /cgn2_6/ptodata/l/ina/5A_COMB.seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB.seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB . seq : * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 42.1865 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 06:59:51 ; Search time 172.814 Seconds 

(without alignments) 
1274.950 Million cell updates/sec 



Title: 



US-09-701-626A-81 



Perfect score: 43 

Sequence : 1 taactattcagtcaagcgga 



aaaccccgctgcgcggtctt 43 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2 /pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06^PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB.seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq : * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq: * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB.seq: * 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq: * 
15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq: * 
16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB. seq: * 
17 : /cgn2_6/ptodata/2 /pubpna/US10E_PUBCOMB . seq : * 
18 : /cgn2_6/ptodata/2/pubpna/US10__NEW_PUB . seq : * 
19 : /cgn2_6/ptodata/2 /pubpna/USll_NEW_PUB . seq : * 
20 : /cgn2_6/ptodata/2 /pubpna/US60_NEW_PUB . seq : * 
21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:43 
Job time : 172.814 sees 



GenCore version 5.1.6 



Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 
Perfect score: 43 



October 22, 2004, 06:54:36 ; Search time 1764.05 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

US-09-701-626A-81 



Sequence : 



1 taactattcagtcaagcgga aaaccccgctgcgcggtctt 43 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1 . 0 

Searched: 32822875 seqs, 18219865908 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



EST:* 



1 


gb_estl : 


* 


2 


gb_est2 : 


* 


3 


gb_htc : * 




4 


gb_est3 : 


* 


5 


gb_est4 : 


* 


6 


gb_est5 : 


* 


7 


gb_est6 : 


* 


8 


gb_gssl : 


* 


9 


gb_gss2 : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1764.05 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



Searched : 



October 22, 2004, 06:27:10 ; Search time 823.508 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 

US-09-701-626A-82 
43 

1 taacaatgcgctcaactgcg acttcgttcgctggacagcc 4 3 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

4526729 seqs , 23644849745 residues 



Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : GenEmbl : * 

1 : gb_ba : * 
2 : gb_htg : * 
3 : gb_in : * 
4 : gb_om : * 
5 : gb_ov : * 
6 : gb_pat : * 
7 : gb_ph : * 
8 : gb_pl : * 
9 : gbjpr : * 
10: gb_ro:* 
11: gb_sts:* 
12: gb_sy:* 
13: gb_un:* 
14: gb_vi : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
BD211358 Method of 



43 100.0 



43 



BD211358 



Run on: 



October 22, 2004, 06:02:20 ; Search time 198.73 Seconds 

(without alignments) 
1135.840 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-701-626A-82 
43 

1 taacaatgcgctcaactgcg acttcgttcgctggacagcc 43 



Scoring table: 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1 . 0 



Searched: 



4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2 001as : * 

5: geneseqn2001bs : * 

6: geneseqn2 002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2 003as : * 

9 : geneseqn2 003bs : * 
10 : geneseqn2003cs : * 
11 : geneseqn2003ds : * 
12 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 43 100.0 43 3 AAZ44971 



Aaz44971 P. alcali 



Search completed: October 22, 2004, 07:28:21 
Job time : 198.73 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 22, 2004, 06:55:41 ; Search time 42.1865 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-82 
43 

1 taacaatgcgctcaactgcg acttcgttcgctggacagcc 43 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 

Search completed: October 22, 2004, 13:45:50 
Job time : 42.1865 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 



- nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 



; Search time 172.814 Seconds 

(without alignments) 

1274.950 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-82 
43 

1 taacaatgcgctcaactgcg 



acttcgttcgctggacagcc 43 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, . 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq : * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBC0MB . seq : * 

9: /cgn2_6/ptodata/2/pubpna/US09A_PUBC0MB. seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB. seq: * 

16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB . seq : * 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB. seq:* 

18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq : * 

19: /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 

20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 

21: /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



. % 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:43 
Job time : 172.814 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



October 22, 2004, 06:54:36 ; Search time 1764.05 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

US-09-701-626A-82 
43 

1 taacaatgcgctcaactgcg acttcgttcgctggacagcc 43 

IDENTTTY_NUC 

Gapop 10.0 , Gapext 1.0 

32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : EST:* 

1: gb_estl:* 
2 : gb_est2 : * 
3: gb_htc:* 
4 : gb_est3 : * 
5 : gb_est4 : * 
6: gb_est5:* 
7 : gb_est6 : * > 
8 : gb_gssl : * 
9 : gb_gss2 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1764.05 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 22, 2004, 06:27:10 ; Search time 823.508 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 

Title : US 7 09-701-626A-83 
Perfect score: 43 

Sequence: 1 taacaagtcgctcaactgcc cactcgttcgctggacagcc 43 

Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4526729 seqs, 23644849745 residues 

Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : GenEmbl : * 



1 




gb_ba : * 


2 




gb_htg : * 


3 




gb_in : * 


4 




gb_om : * 


5 




gb_ov : * 


6 




gb _j?at : * 


7 




gb_ph : * 


8 




gb_j?l : * 


9 




gb _j?r : * 


10: 


gb_ro : * 


11 : 


gb_sts : * 


12 : 


gb_sy : * 


13 : 


gb_un : * 


14 : 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 43 100.0 43 6 BD211359 



BD211359 Method of 



Run on: 



October 22, 2004, 06:02:20 ; Search time 198.73 Seconds 

(without alignments) 
1135.840 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-701-626A-83 
43 

1 taacaagtcgctcaactgcc cactcgttcgctggacagcc 43 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2 001as : * 

5: geneseqn2 001bs : * 

6 : geneseqn2002as : * 

7: geneseqn2002bs : * 

8: geneseqn2003as : * 

9 : geneseqn2003bs : * 

10: geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12: geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total . score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44972 P. alcali 



43 100.0 



43 



AAZ44972 



Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 22, 2004, 06:55:41 ; Search time 42.1865 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-83 
43 

1 taacaagtcgctcaactgcc cactcgttcgctggacagcc 43 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB . seq : * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB.seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB.seq: * 

4 : / cgn2_6 /p todat a/ 1 / ina/ 6B_C0MB .seq : * - 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq : * 

6 : / cgn2_6/ptodata/l/ina/backf ilesl . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 42.1865 sees 

GenCore version 5.1.6 
Copyright, (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 



- nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 ; Search time 172.814 Seconds 

(without alignments) 
1274 . 950 .Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-83 
43 

1 taacaagtcgctcaactgcc 



cactcgttcgctggacagcc 43 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/ 2 /pubpna/US 07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 
15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : * 
16: / cgn2_6 /ptodat a/ 2 /pubpna/US 1 0D_PUBCOMB . seq : * 
17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB . seq : * 
18 : / cgn2_6 /ptodat a/ 2 /pubpna/US 10_NEW_PUB. seq: * 
19 : / cgn2_6 /ptodat a/ 2 /pubpna/US 11_NEW_PUB. seq: * 
20 : / cgn2_6 /ptodat a/ 2 /pubpna/US 6 0_NEW_PUB. seq: * 
21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



October 22, 2004, 07:28:32 ; Search time 1482 Seconds 

(without alignments) 
606.278 Million cell updates/sec 

US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 4526729 seqs , 23644849745 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length : 0 

Maximum DB seq length: 2000000000 



51 



Post-processing : 



Database 



Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



GenEmbl : * 

1 : gb_ba : * 

2: gb_htg:* 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_ov : * 

6 : gb_jpat : * 

7 : gb_ph : * 

8 : gb_pl : * 

9 : gb_pr : * 
10: gb_ro:* 
11 : gb_sts : * 
12: gb_sy:* 
13: gb_un:* 
14 : gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 



SUMMARIES 



% 

Query 



No. 


Score 


Match 


Length DB 


ID 


Description 




1 


19 


100.0 


19 


6 


BD211360 


BD211360 Method 


Of 


2 


19 


100. 0 


31 


6 


BD211366 


BD211366 Method 


of 


3 


19 


100. 0 


52 


6 


BD211355 


BD211355 Method 


of 


4 


19 


100.0 


64 


6 


BD211394 


BD2113 94 Method 


of 



D 


1 Q 

X y 


inn 


n 


74 
/ *± 


£ 
O 


Dno 11^74 


P.T19 1 1 7 74 
DJJZ X X J / ft 


nccnou 01 


£ 


1 Q 

X y 


inn 

X \J u • 


r\ 

. \j 






nno i 1 7 1 n 

oU£t X X J X u 


P.T19 1 1 7 i n 
DJJZ 11J1U 


rJct-iiuu. Ox 


7 


X -? 


inn 


a 

■ \j 


7 




dho 1 1 9 on 


P.T19 119 on 

DJJZ X XZ -7 VJ 


rJcciKJU. 01 


o 
o 


x ^ 


inn 

X U U • 


n 

. \j 


7 




RT19 1 1 ^ n 1 


nno 1 1 7 fl 1 
DJJZ 11JU1 




Q 


1 Q 
x y 


i nn 


r\ 

, u 


7 £ 
/ D 




T3T^9 1 1 1 1 ft 
DUZ X X J X 0 


dJJz llj lo 


Method of 


1 ft 


1 Q 


i nn 


U 


7 £ 


£ 
D 




dUz llj J b 


Method of 


1 1 

X X 


1 Q 

X J 


inn 


ft 


7 £ 


O 


DJJZ X X J ft 0 


DJJZ 1 1 Jfl O 


Method of 


1 9 
X z 


1 Q 

X J7 


inn 
x u u . 


u 


7 7 


D 


dUz X XZ 0 / 


DJJZ 1 lZ O / 


Method of 


-L J 


1 9 

X J 


inn 


n 
w 


7 7 




*Rri9 1 1 9 


DJJz 1 1Z yo 


Method of 


1 4 

X*± 


1 Q 
x y 


inn 




7 7 




PTJ9 1 1 9 Q7 
DJJZ X X Z 27 / 


djjz 1 iz y / 


Method of 


15 


1 9 
J. y 


inn 




7 7 




RT»9 119 9ft 
DIJ^ XXZ 


DJJZ 11Z70 


rieuilOQ OX 


16 


19 


100 . 


0 


77 




RH9 1 1 ^nn 


onn 1 1 7 n n 

DJJZ IIJUU 


ft/I ^ fr* V» /"\ /™\^ 
rJcuilOU OX 


17 


1 9 


inn 

x \j \j • 




7 7 


£ 


PF>9 1 1 709 


DJJZ IIjUZ 


lYjecnoa ox 


18 


19 


100 . 


0 


77 


£ 


RD9 1 1 ^ 1 

LJLJ XXJ XD 


RD9 1 1 7 1 
DJJZ lljlD 


ric LilUU. OX 


19 


1 9 


inn 


n 


77 


£ 


RD9 11717 


nn9 11717 

DUZ lljl / 


nccnuu ox 


2 0 


1 9 
J. y 


i nn 


n 

Lr 


77 


£ 


RH9 117 94 


131^9 117 94 
DJJZ 1 1 J Zft 


ft/I^^Vi/-<r^ 

iMctnou or 


2 1 


19 


100 . 


n 


7 7 


£ 


RH9 117 71 

DiJZ X X O J X 


HFtO 117 7 1 
DJJZ llj jl 


rieunoQ ox 


22 


1 9 
-L y 


i nn 

X U \J * 


n 

u 


7 7 


£ 


"RF19 117 7 9 


DiJZ iijjy 


Method of 


Z O 


1 9 
x y 


inn 


n 


7 7 


£ 
O 


PFl9 11741 


DJJZ 11J4 1 


Method of 


z *± 


1 9 
x y 


inn 

J.UU . 




7 7 


£ 


T5rt9 1 1 7 A A 


DJJZ llo44 


Mecnoa ox 


25 


1 9 


inn 


n 


7 7 


£ 


T3FJ9 1 1 7 R1 


T7F»9 1 1 OC1 
DJJZ lljjl 


inecnoa ox 


26 


1 9 

X ^ 


inn 


o 


7 7 


£ 


Tjno 117^7 

DiJZ X X J O j 


DflO 1 1 1 C *3 

djJz HJdj 


riecfioa ox 


27 


1 9 


inn 


n 


77 


£ 


PD9 1 1 7 R4 


nno 1 1 7 ca 
DJJz 1 lO Z>ft 


jyieunoa ox 


28 


19 


100 . 


o 


77 


£ 


RD9 117 9^ 

DUZ X X J J J 


■DTlO 1 1 7 QC 
DJJZ llj 7 J 


rieuiiOQ. ox 


2 9 


19 


100 . 


n 


7 7 


£ 


^1^9 117 9ft 

DJJZ X X J -7 O 


DJJZ 11J70 


Method of 


3 0 


19 


100 . 


n 


7 ft 
/ o 


£ 


PT19 119 Q4 
DU Z X X Z _? *± 


djjz 1 iz yft 


Method of 


3 1 


1 9 


i nn 


n 


7 ft 
/ o 


£ 


DD7 1 1 9 Q Q 
OUZ X XZ J7 j 


DJJZ 1 IZ y y 


Method of 


32 


1 9 

X ZJ 


i nn 

1 w U t 


o 


7 ft 
/ o 


£ 




TDp>n 1 1 1 ft Q 

djjz lijuy 


Method of 




1 9 

X J 


i nn 




7 ft 


£ 
D 


■R"n9 117 14 
J3LJZ llj X^± 


nnn 1111/1 

dJJZ 1 1 j Ifl 


Method of 


"34 
o ■* 


1 9 
x y 


inn 

X u u . 


n 


7 ft 
/ O 


£ 


pt»9 1 1 ion 

BUZ X X J Z U 


dJJz lljzU 


Mecnoa or 


7 c 
j zj 


1 9 
x y 


inn 


u 


7 ft 


£ 


JDUZ X X J Z J 


dJJz 11 j 


Metrioa or 


3 6 


1 9 
x y 


i nn 




7 ft 


£ 






Metnoa or 


3 7 


19 


inn 


n 


7 ft 


£ 


■RD9 117 9 7 
DJJZ X X J Z / 


Dm 1 1 1 0 t 

DiJZ llJZ / 


Method of 


3 8 


19 


100 . 


n 


7 ft 
/ o 


£ 


pri9 1 1 7 7 r 

-DU Z X X O J O 


nm 1 i 1 1 c 

DJJZ 11 J J J 


Method of 




1 9 

X -7 


inn 

X \J u . 




7 ft 
/ o 


£ 
O 


PTi9 11747 
13 UZ X X J 


nno 111/11 

djjZ 11 J4 3 


Metnoa or 


4 0 


1 9 

X -7 


inn 

X VJ U . 




7 ft 


£ 
O 


pn9 1 1 4 n c; 

DJJZ X X*± U D 


"DT^O 1 1 A f\ C 

dJJz 1 14 Ub 


Metnoa ox 


41 


19 


inn 


n 


7 ft 


£ 


DJJZ lliUO 


Dm 1 1 A C\ c 
DJJZ 1 14 U D 


Method of 


42 


1 9 
x y 


inn 




7 9 


£ 


"RTI9 117 9ft 
DJJZ lljZO 


DT^ll IIIOO 
DJJZ 1 1-5 Z O 


Mecnoa or 


43 


19 


inn 

x v v • 


n 


zj y zj i 


X 


M.X 1Z7J J?Z 


A VI 1 Q 1 QO 

Ai iz y 0 y z 


Pseudomon 


44 


1 9 

X J 


i nn 

X \J \J * 


\j 


797c 


T 
X 


BV1 9 Q 7 Q 7 
M I X Z _7 ,3 -7 J 


J\V1 1Q1Q1 

Ai iz y j y j 


Pseudomon 


4 1; 

ft 3 


1 9 
x y 


i nn 




/ jUU 


c 
0 


Dijz x xz / y 


BD2 11279 


Method of 


46 


19 


100. 


0 


9581 


1 


PAE223604 


AJ223604 


Pseudomon 


47 


19 


100. 


0 


11456 


1 


AE015694 


AE015694 


Shewanell 


48 


19 


100. 


0 


14143 


6 


BD211277 


BD211277 


Method of 


49 


19 


100. 


0 


14143 


6 


BD211278 


BD211278 


Method of 


50 


19 


100. 


0 


18042 


1 


AY038186 


AY038186 


Pseudomon 


51 


i 19 


100. 


0 


42143 


6 


BD211280 


BD211280 


Method of 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 09:26:43 ; Search time 235 Seconds 

(without alignments) 
424.421 Million cell updates/sec 

Title : US- 09- 701 -62 6A- 84 

Perfect score : 19 

Sequence: 1 gccccttaaccaaacgtta 19 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 54 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : 



N_Geneseq_2 3Sep04 : ■ 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 



geneseqnl980s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2 001bs : * 
geneseqn2002as : * 
geneseqn2002bs : * 
geneseqn2003as : * 
geneseqn2003bs : * 
geneseqn2003cs : * 
geneseqn2003ds : * 
geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


19 


100 


.0 


19 


3 


AAZ44973 


Aaz44973 P. alcali 


2 


19 


100 


.0 


31 


3 


AAZ44979 


Aaz44979 P. alcali 


3 


19 


100 


. 0 


34 


10 


ADH19155 


Adhl9155 Single-st 


4 


19 


100 


.0 


52 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



May 13, 2004, 19:34:52 ; Search time 313.172 Seconds 

(without alignments) 
2629.598 Million cell updates/ 



Title: 

Perfect score : 
Sequence : 



US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 



Scoring table : 



I D ENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 6940544 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : GenEmbl : * 



1 




gb_ba : * 




2 




gb_htg : 


* 


3 




gb_in : * 




4 




gb_om : * 




5 




gb_ov : * 




6 




gbjpat : 


* 


7 




gb_jph : * 




8 




gb_j?l : * 




9 




gb_pr : * 




10 


gb_ro : 


* 


11 


gb_sts 


: * 


12 


gb_sy : 


* 


13 


gb_un : 


* 


14 


gb_vi : 


* 


15 


em_ba : 


* 


16 


em_f un 


: * 


17 


em_hum 


: * 


18 


em_in : 


* 


19 


em_mu : 


★ 


20 


em_om : 


* 


21 


em_or : 


* 


22 


em_ov : 


* 


23 


em_pat 


: * 


24 


em_j)h: 


* 


25 


em_jpl : 


* 


26 


em_ro : 


* 


27 


em_sts 


: * 


28 


em un: 


* 



29 


em vi : * 


30 


cam Vlf-Cf VlllFTl * * 


31 


f*m H t~ cr "i nv • * 

Clll Ail— y Xll V . 


32 


Wilt V-/ • 


33 


f^m Vlt~rr miicj • * 


34 


pm Vit~ci nl n • * 


35 


em htg rod : * 


36 


em_htg_mam : * 


37 


em_htg_vrt : * 


38 


em_sy : * 


39 


em_htgo_hum : * 


40 


em_htgo_mus : * 


41 


em_htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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42 


19 
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43 


19 


100, 


.0 
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AY129392 


AY129392 


Pseudomon 


44 
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Pseudomon 


45 
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Title: 

Perfect score : 
Sequence : 



US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 6747726 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : N_Geneseq_2 9Jan04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6 : geneseqn2002s : * 

7 : geneseqn2003as : * 

8 : geneseqn2003bs : * 

9 : geneseqn2 003cs : * 

10: geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



NO. 


Score 


Match 


Length DB 


ID 


Description 




1 


19 


100. 


0 


19 


3 


AAZ44973 


Aaz44973 


P. 


alcali 
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19 


100. 
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31 
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Aaz44979 


P. 


alcali 
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19 


100 . 
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52 


3 
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Aaz44968 


P. 


alcali 
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74 
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P. 


alcali 
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P. 


alcali 
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alcali 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



May 13, 2004, 20:14:57 



Search time 27.3534 Seconds 
(without alignments) 
385.475 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 



Total number of hits satisfying chosen parameters: 1365418 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB, seq: * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB.seq: * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB.seq: * 

6 : /cgn2_6/ptodata/2/ina/backf ilesl.seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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Title: 

Perfect score: 
Sequence : 



US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 



Scoring- table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2947324 seqs, 2269024515 residues 

Total number of hits satisfying chosen parameters: 5894648 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB . seq : * 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBC0MB . seq : * 

5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB.seq: * 

10 : /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/ l/pubpna/US09C_PUBCOMB . seq : * 

12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB.seq: * 

13 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB . seq2 : * 

14 : /cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB . seq : * 

15 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. seq:* 

16 : /cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB. seq: * 

17 : /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 

18 : /cgn2__6/ptodata/l/pubpna/US60_NEW__PUB. seq: * 

19 : /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score : 
Sequence : 



May 13, 2004, 20:21:03 ; Search time 1022.89 Seconds 

(without alignments) 
554.686 Million cell updates/sec 

US-09-701-626A-84 
19 

1 gccccttaaccaaacgtta 19 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 27513289 seqs, 14931090276 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



55026578 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



EST: 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 



em_estba: * 

em_esthum : * 

em_estin: * 

em_estmu : 

em_estov: 

em_estpl : 

em_estro: 

em_htc : * 

gb_estl : * 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em_estfun: * 
em_estom : * 
em_gss_hum: * 
em_gss_inv : * 
em_gss_pln: * 
em_gss_vrt : * 
em_gss_fun: * 
em_gss_mam: * 
em_gss_mus : * 
em__gss_j?ro: * 
em_gss_rod: * 
em_gss_phg : * 
em_gss_vrl : * 
gb_gssl : * 



29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compiigen Ltd. 



OM nucleic 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



nucleic search, using sw model 

May 13, 2004, 19:34:52 ; Search time 329.655 Seconds 

(without alignments) 
2629.598 Million cell updates/sec 

US-09-701-626A-85 
20 

1 ccgagtgagcgaagcgagcg 2 0 

IDENTITYJSTUC 
Gapop 10.0 , Gapext 1 . 0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database 



GenEmbl : * 

1 : gb_ba : * 

2 : gb_htg : * 

3 : gb_in : * 

4 : gb_om : * 

5 : gb_OV : * 

6 : gb_pat : * 

7 : gb_ph : * 

8 : gb_J)l : * 

9 : gb_pr : * 
10: gb_ro:* 
11: gb_sts:* 
12: gb_sy:* 
13: gb_un:* 
14 : gb_vi : * 
15: em_ba:* 
16: em_fun:* 
1 7 : em_hum : * 
18: em_in:* 
19: em_mu:* 
2 0 : em_om : * 
2 1 : em_or : * 
22: em_ov:* 
23: em_j>at:* 
24: em_jph:* 
25: em_j?l:* 

2 6 : em_ro : * 
27: em_sts:* 
2 8: em un : * 
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em_ 


_htg_mam : * 


37 


em_ 


_htg_vrt : * 


38 


em_ 


sy : * 


39 


em_ 


_htgo_hum : * 


40 


em_ 


_htgo_mus : * 


41 


em_ 


_htgo_other : * 



Pred. No. is the number, of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No . 


Score 


Match Length 


DB 


ID 


Description 
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BD211319 Method of 
BD2113 3 8 Method of 
BD211283 Method of 
BD211326 Method of 
AY12 93 93 Pseudomon 
BD211279 Method of 
X683 67 M.thermofor 
X68366 M.thermofor 
AE015934 Staphyloc 
AB125341 Staphyloc 
AC131447 Strongylo 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: May 13, 2004, 18:33:43 ; Search time 125.862 Seconds 

(without alignments) 
675.056 Million cell updates/sec 

Title: US- 09- 701- 62 6A- 85 

Perfect score: 20 

Sequence: 1 ccgagtgagcgaagcgagcg 2 0 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 3373863 seqs, 2124099041 residues 



Total number of hits satisfying chosen parameters: 



6747726 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



N_Geneseq_2 9 Jan04 : * 

1 : geneseqnl980s : * 

2: geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6 : geneseqn2002s : * 

7 : geneseqn2003as : * 

8 : geneseqn2003bs : * 

9 : geneseqn2003cs : * 

10 : geneseqn2004s : * 



Pred. No. is the number- of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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385.475 Million cell updates/sec 



Title: US- 09-701 -626A-85 

Perfect score: 20 

Sequence: 1 ccgagtgagcgaagcgagcg 20 

Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 682709 seqs, 277475446 residues 

Total number of hits satisfying chosen parameters: 1365418 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/2/ina/5A_COMB . seq : * 

2 : /cgn2_6/ptodata/2/ina/5B_COMB . seq : * 

3 : /cgn2_6/ptodata/2/ina/6A_COMB . seq : * 

4 : /cgn2_6/ptodata/2/ina/6B_COMB.seq:* 

5 : /cgn2_6/ptodata/2/ina/PCTUS_COMB . seq: * 

6 : /cgn2_6/ptodata/2/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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Result Query 

No. Score Match Length DB ID Description 
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Title: 

Perfect score: 
Sequence : 



US-09-701-626A-85 
20 

1 ccgagtgagcgaagcgagcg 2 0 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2947324 seqs, 2269024515 residues 

Total number of hits satisfying chosen parameters: 5894648 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB.seq: * 

3 : /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB . seq: * 

5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB.seq: * 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB . seq: * 

9 : /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB . seq : * 
10 : /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/ l/pubpna/US09C_PUBCOMB . seq : * 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB . seq : * 
13 : /cgn2__6/ptodata/l/pubpna/US09_NEW_PUB.seq2 : * 
14 : /cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB . seq : * 
15 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB . seq: * 
16 : /cgn2_6/ptodata/l/pubpna/US10C_PUBC0MB . seq : * 
17 : /cgn2_6/ptodata/l/pubpna/US10_NEW_PUB . seq : * 
18 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB . seq : * 
19 : /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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bequence 1 


0 0 O 0 , Ap 




z y 


T C 

lb 


. z 


1 b . 


A 

u 


C A Q 

b4 y 


y 


TTC 


A O 

u y - 


ob / - 


"7 A 1 O Q "7 O v 

/oi-zy / Z 


Sequence 


O Q H O 7\ -r^ 

Zy / Z t Ap 


c 


*3 A 
J 0 


1b 


. Z 


/ b . 


U 


c / 1 

b4 1 


T A 
1U 


TTO 

Ub 


A Q 

- u y 


O A *3 

- y o 6 


T Q A £L £L 

- iy o - b b 


Sequence 


b b , App 1 




O 1 


lb 


. Z 


1 b . 


u 


s~ Q /~ 

&yb 


y 


TTO 

Ub - 


A Q 

uy - 


■7 "3 Q 

/jo- 


ez "s el *3 "3 c *7 


Sequence 


"3 *3 C T Ai^i 

i j b / / Ap 


c 


*3 O 
JZ 


T C 

lb 


. Z 


1 b . 


U 


*7 O "3 


lb 


TTO 

Ub 


1 A 

-IU 


- lbo 


nri -3 A A C 

- /ol--500b 


Sequence 


"3 A A C A -t-n 

0 0 0 b , Ap 


c 


O O 


lb 


. Z 


1 b . 


U 


o o o 
oZZ 




TTO 

Ub 


-10 


A O T 

- 0Z / 


-boz-ljoo4j 


Sequence 


1 *3 O C A "3 


c 


"3 /t 


1 c 

lb 


. Z 


1 b . 


f\ 
0 


o o o 
OZZ 


lo 


TTO 

Ub 


T A 

-10 


A O T 

- Oz 7 


-bjz-lJoo4j 


Sequence 


13 8643 , 


c 


1 c 
J b 


lb 


. z 


1 b . 


A 

u 


Q A Ck 

o4 y 


13 


TTO 

Ub 


-10 


TOO 

- z OZ 


T*7*7A "50QAC 

- 1Z ZA-ZZ a 0b 


Sequence 


O O O A C A 

ZZ o0b , A 


c 


0 O 


ID 


9 
. Z 


/ b . 


A 
U 


o b4 


1j 


TTO 
Ub 


- 1 A 
1 U 


- z o z 


_1 TTA.I I^AA 
- 1ZZH.-1JJ uu 


bequence 


1 "X 0 A a a 
loo 00 , A 




1 *7 


1 c 

lb 


. z 


/ b . 


A 
U 


Q H C\ 

o / U 


lo 


TTC 

Ub 


"1 A 

-10 


ICQ 

- j by 


ylQO / CC07 

-4 y j - 4 bb o / 


Sequence 


^ a C O *7 A 

4 bb 0 / , A 




J O 


"1 c 

lb 


. Z 


*7 C 


A 

u 


o /z 


1 J 


TTO 

Ub 


1 A 

-10 


A O C 

-4 Z b 


11/1 OfirtCO 

- 114 - z b Obo 


Sequence 


O *T A c O A 

z b Obo , A 






X D 


9 
• ^ 


/ D • 








TTC 
\j o 




-AO 
*± £t D 


ill ^DulJ 


Q & m 1 0 t*i 0 0 


9 £ A 1 C A 


c 


40 


15 


.2 


76. 


0 


975 


13 


us 


-10 


-424 


-599-80493 


Sequence 


80493, A 


c 


41 


15 


.2 


76 . 


0 


1164 


16 


us 


-10 


-369 


-493-32142 


Sequence 


32142, A 




42 


15 


.2 


76. 


0 


1351 


9 


us- 


09- 


964- 


899-30 


Sequence 


30, Appl 


c 


43 


15 


.2 


76. 


0 


1422 


9 


us- 


09- 


738- 


626-3356 


Sequence 


3356, Ap 




44 


15 


.2 


76. 


0 


1434 


15 


us 


-10 


-156 


-761-7349 


Sequence 


7349, Ap 


c 


45 


15 


.2 


76. 


0 


1601 


16 


us 


-10 


-411 


-010-16 


Sequence 


16, Appl 



Scoring table: 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 27513289 seqs, 14931090276 residues 

Total number of hits satisfying chosen parameters: 55026578 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : EST:* 

1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6 : em_estpl : * 

7: em_estro:* 

8 : em_htc : * 

9: gb_estl:* 
10: gb_est2:* 
11: gb_htC:* 
12 : gb_est3 : * 
13: gb_est4:* 
14 : gb_est5 : * 
15: em_estfun:* 
16: em_estom:* 
1 7 : em_g s s_hum : * 
18: em_gss_inv:* 
19: em_gssj?ln:* 
20: em_gss_vrt:* 
21: em_gss_fun:* 
2 2 : em_gs s_mam : * 
2 3 : em_gss_mus : * 
24: em_gss_pro:* 
25: em_gss_rod:* 
2 6 : em_gss _j?hg : * 
27: em_gss_vrl:* 
28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 


19 


95. 


0 


511 


28 


AZ170008 


AZ170008 


SP_0115_A 




2 


18.4 


92 . 


0 


1459 


12 


BG745247 


BG745247 


602723418 


c 


3 


17.4 


87. 


0 


394 


28 


AZ196356 


AZ196356 


SP 1031 B 





A 

ft 


T 7 
X / 


A 
. ft 


a 7 
of. 


a 
U 


"IOC 


9 P 
Z 0 


duo A cnoi 
criouDuyx 


duo n c. n q i 
ori o u d u y x 


1UUOU b DJJX 


c 


c 
D 


1 7 
X / 


A 
. ft 


P7 
Of. 




A 9 7 
ft Z Z 


9 ft 
Z O 




Zi7 1 £ no R p 

n£u X D v Z D O 


O it UUDj o 




o 


1 7 
X / 


A 


ft 7 






1 9 
X z 


RT7771 


RT7771 QR 


Lj r JjvjJ Uvw 




n 

1 


1 7 
X / 


A 
. ft 


ft 7 
Of* 


n 


D Z X 


9 ft 
Z O 


rJZi djo J. /x 


DLOj O X / X 


wvjAfir X / 11*1 


c 


o 
O 


1 7 
X / 


A 
. ft 


P 7 
O / . 


u 


Oft O 


9 P 
Z O 


COD J O 13 / 


CO 

uub J b Id / 


UoAJYlr X / 1 




Q 

y 


1 7 
X / 


. ft 


ft 7 
Of. 


n 


t^79 
D / Z 


9 ft 
Z 0 


7\ «7 1 Q n A Q£ 

Aii x y u ft y o 


A^t x y u ft y b 


ctd i n 1 7 fi 
olr XU X / A 


c 


X U 


1 7 
X / 


A 
. ft 


ft 7 


n 


7 a n 

/ ft U 


' 9 ft 
Z O 


i\£a IOjj / ft 


Ai< X 0 D D / ft 


cd i n n r a 

or iUUD A 




X X 


T 7 

X / 


A 
. ft 


ft 7 
Of. 


u 


7 A n 
/ ft VJ 


9 P 
Z 0 


AZi x y D z o u 


Ao x y b z o u 


CD 1 rt"51 "Q 

or XU o X D 




1 9 
XZ 


1 7 
X / 


. ft 


ft 7 
Of. 


n 


TCI 

/ D X 


9 Q 
Z J 


pnoo CQOD 
OZ 0 


ppQQ C p O P 

L.L. y o D O Z O 


ZiUA-byy b i v 




-L O 


1 7 

X / 


A 
. ft 


fl7 


n 


7 ^ Q 


9 R 
Z O 


t\Li x o y o j j 


ZV71 Qocn 

AZi 1070 J J 


OD 1 rtl C A 

or XUXD A 




Xft 


1 7 
X / 


. ft 


ft 7 
Of. 


u 


P 1 1 
O X X 


9 P 
Z O 


x y D U ft o 


az. x y d ufi b 


CD i n O O A 

or xuzy A 






1 7 
x / 


A 
. ft 


ft 7 


u 


ft 9 


9 ft 
Z O 


7\71 Q1 pOA 

r\ci x y X O J u 


jwi qi pin 
A^t x y x o j u 


qd i n 9 n a 

or X UZ U A 




1 £ 


1 7 
X / 


A 
. *± 


ft 7 


n 


ft 9 
O Z 3 


9 P 
Z O 


13x10 J 


DUO TOOCC 

Dnoo y z d o 


t mpd 1 cnm 

Xj1 v IV^KXD u U X 


c 




1 7 
X / 


4. 
. *± 


ft 7 
Of. 


n 


ft A ft 
Oft O 


9 Q 
Z J 


v-\-.y o z / d / 


CC982757 


ZUAAW63TH 


c 


1 ft 
X o 


1 7 
X / 




ft 7 
Of. 


n 

U 


ft 7 1 
o / X 


9 P 
Z O 


DU1 9 7 C A P 
DxlXZ / Oft 0 


BH127648 


G-1123 Ma 


c 


x y 


1 7 
X / 


. ft 


P 7 


n 
u 


y x j 


9 P 
z 0 


"D77A1 QOI 

at* i u xyz i 


BZ701921 


PUCEJ37TD 


c 


9 n 
z u 


1 7 
X / 


A 
. ft 


P 7 


a 
U 


Q A C 

y ft d 


7 Q 

z y 


D / UO J 


CG367083 


OG0FA40TV 




9 1 

z X 


1 7 
X / 


A 
. ft 


P 7 


A 

U 


i coo 
Id 


t n 

xu 


br b y J y o z 


BF693932 


602082508 




9 9 
z z 




1 7 
X / 


ft R 


u 


TCI 

jDI 


T 7 

xz 


tj T1 P Q7 p n 

du x o y / o u 


BJ189780 


BJ189780 




9 1 
Z Z> 




1 7 
X / 


OD . 


u 


A~\ f\ 

ft X u 




AT QCC1 CO 

Axiy bb xdz 


AL965152 AL965152 


c 


9 A 
Zfi 




1 7 
X / 


P C 


U 


A 7 /l 
ft Zft 


X j 




BQ162383 


WHE043 6_F 




Z D 




1 7 
X / 


ft R 
O D . 


n 


A 9 7 
ft Z / 


1 9 
XZ 


■R.T9 m n 7 p 


BJ203078 


BJ203078 




Z D 




1 7 
x / 


ft R 
D D . 


A 

u 


cot 

o z o 


9 ft 

z o 




AZ151255 


SP_0049_A 




z / 




1 7 
x / 


ft R 
o d . 


A 
U 


£ftft 

o o o 


9 Q 

z y 


PMCm CDT 
\^Vi OU J DDI 


AL228519 


Tetraodon 


c 


9 ft 
Z O 


XO 


Q 
. 0 


ft A 
Oft . 


A 
U 


1 7 Q 


lj 


v^Axuy j ft y 


CA109349 


SCSGHR107 




Z -7 


X D 


O 
• O 


ft A 


A 
U 


1 ft 7 
j O J 


• 9 Q 

Z J7 


PPQ'l "I QO 0 


CC931933 


ZMMBBC054 


c 


7 n 


X D 


Q 

. o 


ft A 
Oft . 


A 
U 


1 QA 
J J ft 


9 Q 

z y 


^L.0 jOjIu 


CC838316 


ZMMBBC04 9 




*i i 

J X 


X D 


p 
. o 


ft A 
Oft . 


A 
U 


Ann 
ft u u 


1 A 
Xft 


r*A o n A Q 1 1 

a / u ft y x j 


CA704913 


wdklc .pkO 




1 9 
J z 


X D 


p 


PA 
Oft . 


A 
U 


A A Q 
ft ft y 


9 P 
Z O 


Ay 0 D Z / J ft 


AQ852734 


LMAJFV1_1 




1 
J J 


1 £ 
X D 


p 


PA 
Oft . 


A 
U 


A 7 1 
ft / X 


1 A 
Xft 


pro/ ocnn 
Zft J D U U 


CF243500 


3530_1_21 


c 


•1/1 
J ft 


X O 


p 


PA 
Oft . 


A 
U 


A Q 7 
ft 27 / 


7 P 
Z 0 


DUO 

atiz y o oft / 


BH293647 


CH230-30M 




j D 


X D 


p 
. o 


PA 
Oft . 


A 
U 


O 


7 Q 

^ y 


Avj X O O O U Z 


AG166602 


Pan trogl 


c 


J D 


X O 


p 


PA 
Oft . 


A 
U 


O 0 X 


9 Q 

z y 


LbUZ D Dft J 


CG025543 


ZMMBBC056 




1 7 


1 

X D 


p 
. o 


ft A 
Oft . 


A 
U 


7 n i 


9 ft 

z o 


r'PA C1 A Q7 

v-L-ft o xft y / 


CC461497 


ZMMBBC035 


c 


1 P 
J 0 


1 C 

Xb 


Q 


D A 
Oft . 


A 
U 


/jj 


z y 




CC814629 


ZMMBBcOSl 


c 


39 


16 


. 8 


84. 


0 


752 


28 


CC460796 


CC460796 


ZMMBBC036 


c 


40 


16 


. 8 


84 . 


0 


763 


29 


CG025807 


CG025807 


ZMMBBC056 




41 


16 


. 8 


84 . 


0 


771 


14 


CF243846 


CF243846 


3530_1_24 




42 


16 


.8 


84 . 


0 


780 


29 


CG900303 


CG900303 


ZMMBBbOSO 


c 


43 


16 


.8 


84. 


0 


852 


29 


CG390408 


CG390408 


ZMMBBC057 


c 


44 


16 


.8 


84 . 


0 


857 


29 


CG388577 


CG388577 


ZMMBBC056 



GenCore version 5.1.6 

Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



Searched: 



October 22, 2004, 06:27:10 ; Search time 631.995 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 

US-09-701-626A-86 
33 

1 aaactcgagggtcccgagtgagcgaagcgagcg 33 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

4526729 seqs, 23644849745 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



GenEmbl : * 



1 




gb_ba : * 


2 




gb_htg : * 


3 




gb_in: * 


4. 




gb_om : * 


5 




gb_ov : * 


6 




gb _j?at : * 


7 




gb j)h : * 


8 




gb_pl : * 


9 




gb_j?r : * 


10: 


gb_ro : * 


11: 


gb_sts : * 


12 : 


gb_sy : * 


13 : 


gb_un : * 


14 : 


gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 33 100.0 33 6 BD211362 



BD2113 62 Method of 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-09-701-626A-86 
33 

1 aaactcgagggtcccgagtgagcgaagcgagcg 33 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters : 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: 



Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : 



N_Geneseq_23Sep04 : ' 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4: geneseqn2001as : 

5 : geneseqn2001bs : 

6 : geneseqn2002as : 
7 : geneseqn2002bs : 
8: geneseqn2003as : 
9 : geneseqn2003bs : 
10 : geneseqn2003cs 
1 1 : gene seqn2 0 0 3 ds 
12 : geneseqn2004s : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 33 100.0 33 3 AAZ44975 



Aaz44975 P. alcali 



Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 22, 2004, 06:55:41 ; Search time 32.3757 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

Title: US-09-701-626A-86 
Perfect score: 33 

Sequence: 1 aaactcgagggtcccgagtgagcgaagcgagcg 33 



Scoring table: 



Searched: 



IDENTITY_NUC 

Gapop 10.0 # Gapext 1.0 

824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB. seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq : * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2 004, 13:45:50 
Job time : 32.3757 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 ; Search time 132.624 Seconds 

(without alignments) 
1274.950 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-701-626A-86 
33 

1 aaactcgagggtcccgagtgagcgaagcgagcg 33 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq : * 
10 : /cgn2_6/ptodata/2/pubpna/US09B__PUBCOMB . seq : * 
11 : /cgn2_6/ptodata/2/pubpna/US09CJ>UBCOMB . seq : * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB . seq : * 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 
15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : * 
16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB . seq : * 
17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB . seq : * 
18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
19 : /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB . seq: * 
20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq : * 
21 : /cgn2_6/ptodata/2/pubpna/US60_PUBC0MB . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 

NO. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:43 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



October 22, 2004, 06:27:10 ; Search time 670.297 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 



Title: 
Perf ec 
Sequence : 

Scoring table: 



US-09-701-626A-91 
Perfect score: 35 

1 tgctctagacggccgcccgttagcttaatcgttag 35 

IDENTITY_NUC 
Gapop 10.0 , Gapext 1.0 



Searched : 



4526729 seqs , 23644849745 residues 



Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



gb_ba : * 
gb_htg : * 
gb_in: * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_jph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
BD211367 Method of 



35 100.0 



35 6 BD211367 



ALIGNMENTS 



RESULT 1 
BD211367 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD211367 35 bp DNA linear PAT 17-JUL-2003 

Method of finding restriction enzyme. 

BD211367 

BD211367.1 GI:33021137 

JP 2002517260-A/91. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 35) 

Raleigh, E. A. , Vaisvila,R. and Morgan, R.D. 

Method of finding restriction enzyme 

Patent: JP 2002517260-A 91 18-JUN-2002; 

NEW ENGLAND BIOLABS INC 

OS Unknown 

PN JP 2002517260-A/91 

PD 18-JUN-2002 

PF ll-JUN-1999 JP 2000553622 

PR 12-JUN-1998 US 60/ 089086 , 12 -JUN- 1998 US 60/089101 PI 
ELISABETH A RALEIGH, ROMUALDAS VAIS VI LA, RICHARD D MORGAN PC 
C12Ql/68,C12N15/0 9,C12N15/00 

CC Description of Unknown Organism: Synthetic Oligonucleotide CC 
based on 

CC Pseudomonas Alcaligenes NEB#585 (ATCC 55044) 

FH Key Location/Qualif iers 

FT source 1. .35 

FT / org anism= 'Unknown' . 

Location/Qualif iers 

1. .35 

/ organism= "unidentified" 
/mol_type= "genomic DNA" 
/db xref ="taxon: 32644" 



Query Match 100.0%; Score 35; DB 6; Length 35; 

Best Local Similarity 100.0%; Pred. No. 2.2e-05; 
Matches 35; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 



Db 



1 TGCTCTAGACGGCCGCCCGTTAGCTTAATCGTTAG 3 5 

1 1 1 1 1 II 1 1 1 1 1! I 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1 1 1 1 

1 TGCTCTAGACGGCCGCCCGTTAGCTTAATCGTTAG 35 



Search completed: October 22, 2004, 09:26:36 
Job time : 671.297 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 22, 2004, 06:02:20 ; Search time 161.757 Seconds 



(without alignments) 

1135.840 Million cell updates/sec 



Title: US-09-701-626A-91 
Perfect score: 35 

Sequence: 1 tgctctagacggccgcccgttagcttaatcgttag 35 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1 . 0 

Searched: 4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44980 P. alcali 



35 100.0 



35 3 AAZ44980 



ALIGNMENTS 



RESULT 1 
AAZ44980 

ID AAZ44980 standard; DNA; 35 BP. 
XX 

AC AAZ44980; 
XX 

DT 16-MAY-2000 (first entry) 



DE P. alcaligenes repeat (PAR) element DNA #87. 
XX 

KW Diversity-selected gene; restriction enzyme; adhesin; toxin; 

KW detoxifying enzyme; repeat element; PAR; ss. 

XX 

OS Pseudomonas alcaligenes. 
XX 

PN W09964632-A1. 
XX 

PD 16-DEC-1999. 
XX 

PF ll-JUN-1999; 99WO-US0132 95 . 
XX 

PR 12-JUN-1998; 98US-0089086P . 

PR 12-JUN-1998; 98US-0089101P . 
XX 

PA (NEWE ) NEW ENGLAND BIOLABS INC. 
XX 

PI Raleigh EA, Vaisvila R, Morgan RD; 
XX 

DR WPI; 2000-116558/10. 
XX 

PT Cloning intact genes used to isolate genes for restriction enzymes. 
XX 

PS Claim 10; Page 62; 97pp; English. 
XX 

CC This invention describes a novel method for cloning intact, diversity - 

CC selected genes (I) from within gene cassettes (GC) which comprises 

CC identifying DNA repeats that flank GC, hybridizing oligonucleotides (ON) 

CC to these repeats and amplification to produce DNA fragments containing 

CC (I) , ligating these fragments into a vector and transforming cells with 

CC the vector. This method is used to clone a wide variety of prokaryotic 

CC genes that provide a selective advantage under particular conditions, 

CC particularly those that encode restriction enzymes (used as reagents in 

CC molecular biology) ; adhesins (for use in coating or for targeting 

CC molecules or organisms to particular sites, e.g. for competitive 

CC exclusion of a selected pathogen) ; detoxifying enzymes; toxins that 

CC interact with a host, e.g. for synthesis of inhibitors or antagonists of 

CC the toxin, or in vaccination, or a modification methyltransf erase . Intact 

CC genes can be cloned directly with a high probability that the orientation 

CC of expression is known in advance and low probability of association with 

CC extraneous, possibly toxic, genes. AAZ44894-Z44980 represent the 

CC Pseudomonas alcaligenes repeat (PAR) elements described in the method of 

CC the invention 

XX 

SQ Sequence 35 BP; 6 A; 10 C; 9 G; 10 T; 0 U; 0 Other; 

Query Match 100.0%; Score 35; DB 3; Length 35; 

Best Local Similarity 100.0%; Pred. No. 2.1e-06; 

Matches 35; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 TGCTCTAGACGGCCGCCCGTTAGCTTAATCGTTAG 35 

II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 1 TGCTCTAGACGGCCGCCCGTTAGCTTAATCGTTAG 35 



Search completed: October 22, 2004, 07:28:22 
Job time : 161.757 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 06:55:41 ; Search time 34.3378 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

Title: US-09-701-626A-91 
Perfect score: 35 

Sequence: 1 tgctctagacggccgcccgttagcttaatcgttag 35 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters : 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB .seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB . seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB . seq : * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB . seq : * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 34.3378 sees 



GenCore version 5.1.6 
Copyright (c) " 1993 - 2004 Compugen Ltd. 



OM nucleic 



- nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 



; Search time 140.662 Seconds 

(without alignments) 

1274.950 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-701-626A-91 
35 

1 tgctctagacggccgcccgttagcttaatcgttag 35 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB.seq:* 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB.seq:* 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq: * 
12 : / cgn2_6/ptodata/ 2 /pubpna/US 09_NEW_PUB. seq: * 
13: / cgn2_6 /pt oda t a/ 2 /pubpna/US 1 0 A_PUBCOMB . seq : * 
14 : / cgn2_6 /p t oda t a/ 2 /pubpna/US 1 0B_PUBCOMB . seq : * 
15: / cgn2_6 /p t oda t a/ 2 /pubpna/US 1 0C_PUBCOMB . seq : * 
16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB . seq : * 
17: / cgn2_6 /p t odat a/ 2 /pubpna/US 1 0E_PUBCOMB . seq : * 
1 8 : / cgn2_6 /pt odat a/ 2 /pubpna/US 1 0_NEW_PUB . seq : * 
19 : /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB. seq: * 
2 0 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq : * 
2 1 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:44 
Job time : 140.662 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 06:54:36 ; Search time 1435.85 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

Title: US-09-701-626A-91 
Perfect score: 35 

Sequence: 1 tgctctagacggccgcccgttagcttaatcgttag 3 5 

Scoring table: IDENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : EST: * 



1 


gb_estl 


: * 


2 


gb__est2 


: * 


3 


gb_htc : 


k 


4 


gb_est3 


: * 


5 


gb_est4 


: * 


6 


gb_est5 


: * 


7 


gb_est6 


: * 


8 


gb_gssl 


: * 


9 


gb_gss2 


: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1435.85 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:27:10 ; Search time 593.692 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 



US-09-701-626A-90 
31 

1 tgctctagacggccccttaaccaaacgttag 31 



Scoring table: 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1 . 0 



Searched: 4526729 seqs, 23644849745 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : GenEmbl : * 



1 


gb_ba : * 


2 


gb_htg : * 


3 


gb_in : * 


4 


gb_om : * 


5 


gb_ov : * 


6 


gb_pat : * 


7 


gb_ph : * 


8 


gb_ pi : * 


9 


gb _pr : * 


10: gb_ro:* 


11: gb_sts:* 


12: gb_sy:* 


13: gb_un:* 


14 : gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Query 

Score Match Length DB ID Description 



Result 
No. 



31 100.0 31 6 BD211366 BD211366 Method of 



ALIGNMENTS 



RESULT 1 
BD211366 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD211366 31 bp DNA linear PAT 17-JUL-2003 

Method of finding restriction enzyme. 

BD211366 

BD2 113 66. 1 GI: 33021136 

JP 2002517260-A/90. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 31) 

Raleigh, E .A. , Vaisvila,R. and Morgan, R.D. 

Method of finding restriction enzyme 

Patent: JP 2002517260-A 90 18-JUN-2002; 

NEW ENGLAND BIOLABS INC 

OS Unknown 

PN JP 2002517260-A/90 

PD 18-JUN-2002 

PF ll-JUN-1999 JP 2000553622 

PR 12-JUN-1998 US 60/ 089086 , 12 -JUN- 1998 US 60/089101 PI 
ELISABETH A RALEIGH, ROMUALD AS VAIS VI LA, RICHARD D MORGAN PC 
C12Ql/68,C12N15/09,C12N15/00 

CC Description of Unknown Organism: Synthetic Oligonucleotide CC 
based on 

CC Pseudomonas Alcaligenes NEB#585 (ATCC 55044) 

FH Key Location/Qualif iers 

FT source 1. .31 

FT /organism=' Unknown' . 

Location/Qualifiers 

1. .31 

/ organism= "unidentified" 
/ mo l_type= "genomic DNA" 
/db xref="taxon:3 2644" 



Query Match 100.0%; Score 31; DB 6; Length 31; 

Best Local Similarity 100.0%; Pred. No. 0.00028; 
Matches 31; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 TGCTCTAGACGGCCCCTTAACCAAACGTTAG 31 

I I I I I I I I I I I I I I I I I I I I I I I II I I I II I 
1 TGCTCTAGACGGCCCCTTAACCAAACGTTAG 31 



Search completed: October 22, 2004, 09:26:35 
Job time : 593.692 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 

OM nucleic - nucleic search, using sw model 

Run on: October 22, 2004, 06:02:20 ; Search time 143.27 Seconds 



(without alignments) 

1135.840 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-09-701-626A-90 
31 

1 tgctctagacggccccttaaccaaacgttag 31 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : N_Geneseq_23Sep04 : * 

1: geneseqnl980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5 : geneseqn2 001bs : * 

6 : geneseqn2002as : * 

7 : geneseqn2002bs : * 

8: geneseqn2003as : * 

9 : geneseqn2003bs : * 

10: geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44979 P. alcali 



31 100.0 



31 



AAZ44979 



ALIGNMENTS 



RESULT 1 
AAZ44979 

ID AAZ44 979 standard; DNA; 31 BP-. 
XX 

AC AAZ44979; 
XX 

DT 16-MAY-2000 (first entry) 



DE P. alcaligenes repeat (PAR) element DNA #86. 
XX 

KW Diversity-selected gene; restriction enzyme; adhesin; toxin; 

KW detoxifying enzyme; repeat element; PAR; ss. 

XX 

OS Pseudomonas alcaligenes. 
XX 

PN W09964632-A1. 
XX 

PD 16-DEC-1999. 
XX 

PF ll-JUN-1999; 99WO-US0 132 95 . 
XX 

PR 12-JUN-1998; 98US- 008 9086P . 

PR 12-JUN-1998; 98US- 0089101P . 
XX 

PA (NEWE ) NEW ENGLAND BIOLABS INC. 
XX 

PI Raleigh EA, Vaisvila R, - Morgan RD; 
XX 

DR WPI; 2000-116558/10. 
XX 

PT Cloning intact genes used to isolate genes for restriction enzymes. 
XX 

PS Claim 10; Page 62; 97pp; English. 
XX 

CC This invention describes a novel method for cloning intact, diversity - 

CC selected genes (I) from within gene cassettes (GC) which comprises 

CC identifying DNA repeats that flank GC, hybridizing oligonucleotides (ON) 

CC to these repeats and amplification to produce DNA fragments containing / 

CC (I) , ligating these fragments into a vector and transforming cells with 

CC the vector. This method is used to clone a wide variety of prokaryotic 

CC genes that provide a selective advantage under particular conditions, 

CC particularly those that encode restriction enzymes (used as reagents in 

CC molecular biology) ; adhesins (for use in coating or for targeting 

CC molecules or organisms to particular sites, e.g. for competitive 

CC exclusion of a selected pathogen) ; detoxifying enzymes; toxins that 

CC interact with a host, e.g. for synthesis of inhibitors or antagonists* of 

CC the toxin, or in vaccination, or a modification methyltransf erase . Intact 

CC genes can be cloned directly with a high probability that the orientation 

CC of expression is known in advance and low probability of association with 

CC extraneous, possibly toxic, genes. AAZ44894 -Z44980 represent the 

CC Pseudomonas alcaligenes repeat (PAR) elements described in the method of 

CC the invention 

XX 

SQ Sequence 31 BP; 8 A; 10 C; 6 G; 7 T; 0 U; 0 Other; 



^ Query Match 100:0%; Score 31; DB 3; Length 31; 

Best Local Similarity 100.0%; Pred. No. 0.00012; 

Matches 31; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 TGCTCTAGACGGCCCCTTAACCAAACGTTAG 31 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : 

Db 1 TGCTCTAGACGGCCCCTTAACCAAACGTTAG 31 



Search completed: October 22, 2004, 07:28:22 
Job time : 143.27 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



October 22, 2004, 06:55:41 ; Search time 30.4135 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-90 
31 

1 tgctctagacggccccttaaccaaacgttag 31 

I D ENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB . seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB . seq : * 

3 : /cgn2_6/ptodata/l/ina/6A__COMB.seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB.seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 30.4135 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 



/ Search time 124.586 Seconds 

(without alignments) 

1274.950 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-90 
31 

1 tgctctagacggccccttaaccaaacgttag 31 



Scoring table: 



I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07 JPUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB . seq : * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq: * 

10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB . seq : * 

11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB. seq: * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : * 

16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB. seq: * 

17 : /cgn2_6/ptodata/2 /pubpna/US10E_PUBCOMB . seq : * 

18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq : * 

19 : /cgn2_6/ptodata/2/pubpna/USll__NEW_PUB . seq : * 

20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB.seq: * 

21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB.seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:44 
Job time : 124.586 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



Searched : 



October 22, 2004, 06:54:36 ; Search time 1271.75 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

US-09-701-626A-90 
31 

1 tgctctagacggccccttaaccaaacgttag 31 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : EST:* 

1: gb_estl:* 
2: gb_est2:* 
3 : gb_htc : * 
4: gb_est3:* 
5: gb_est4:* 
6: gb_est5:* 
7: gb_est6:* 
8 : gb_gssl : * 
9 : gb_gss2 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1271.75 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



October 22, 2004, 06:27:10 ; Search time 631.995 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 

US-09-701-626A-89 
33 

1 aaactcgagaccgcgcagcggggtttgcgtccg 33 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

4526729 seqs, 23644849745 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 



gb_ba : * 

gb__htg : * 

gb_in : * 

gb_om : * 

gb_ov : * 

gb_pat : * 

gb _jph : * 

gb_pl : * 

gb_pr : * 
gb_ro : * 
gb_s t s : * . 
gb_sy : * 
gb_un : * 
gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 33 100.0 33 6 BD211365 



BD211365 Method of 



ALIGNMENTS 



RESULT 1 
BD211365 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD211365 33 bp DNA linear PAT 17-JUL-2003 

Method of finding restriction enzyme. 

BD211365 

BD211365.1 GI : 33021135 

JP 2002517260-A/89. 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 33) 

Raleigh, E. A. , Vaisvila,R. and Morgan, R.D. 

Method of finding restriction enzyme 

Patent: JP 2002517260-A 89 18-JUNt2002; 

NEW ENGLAND BIOLABS INC 

OS Unknown 

PN JP 2002517260-A/89 

PD 18-JUN-2002 

PF ll-JUN-1999 JP 2000553622 

PR 12-JUN-1998 US 60/ 089086 , 12 -JUN- 1998 US 60/089101 PI 
ELISABETH A RALEIGH, ROMUALDAS VAIS VI LA, RICHARD D MORGAN PC 
C12Ql/68,C12N15/09, C12N15/00 

CC Description of Unknown Organism: Synthetic Oligonucleotide CC 
based on 

CC Pseudomonas Alcaligenes NEB#585 (ATCC 55044) 

FH Key Location/Qualifiers 

FT source 1. .33 

FT / organism= ' Unknown ' . 

Location/Qualif iers 

1. .33 

/organism= "unidentified" 
/mol_type= "genomic DNA" 
/db xref="taxon: 32644" 



Query Match 100.0%; Score 33; DB 6; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.075; 

Matches 33; Conservative 0; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 AAACTCGAGACCGCGCAGCGGGGTTTGCGTCCG 33 

I II li I Mil I I II II II II III Ul II Mi II 
1 AAACTCGAGACCGCGCAGCGGGGTTTGCGTCCG 33 



Search completed: October 22, 2004, 09:26:35 
Job time : 631.995 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 
Run on: 



- nucleic search, using sw model 
October 22, 2004, 06:02:20 ; 



Search time 152.514 Seconds 



(without alignments) 

1135.840 Million cell updates/sec 



Title: US-09-701-626A-89 
Perfect score: 33 

Sequence: 1 aaactcgagaccgcgcagcggggtttgcgtccg 33 

Scoring table: IDENT I T Y__NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2Q00s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6 : geneseqn2002as : * 

7 : geneseqn2002bs : * 

8 : geneseqn2003as : * 

9 : geneseqn2003bs : * 

10 : geneseqn2003cs : * 

11 : geneseqn2003ds : * 

12 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44978 P. alcali 



33 100.0 



33 3 AAZ44978 



ALIGNMENTS 



RESULT 1 
AAZ44978 

ID AAZ44978 standard; DNA; 33 BP. 
XX 

AC AAZ44 978; 
XX 

DT 16-MAY-2000 (first entry) 



DE P. alcaligenes repeat (PAR) element DNA #85. 
XX 

KW Diversity- selected gene; restriction enzyme; adhesin; toxin; 

KW detoxifying enzyme; repeat element; PAR; ss. 

XX 

OS Pseudomonas alcaligenes. 
XX 

PN W09964632-A1. 
XX 

PD 16-DEC-1999. 
XX 

PF ll-JUN-1999; 99WO-US0132 95 . 
XX 

PR 12-JUN-1998; 98US- 008 9086P . 

PR 12-JUN-1998; 98US- 008 9101P . 
XX 

PA (NEWE ) NEW ENGLAND BIOLABS INC. 
XX 

PI Raleigh EA, Vaisvila R, Morgan RD; 
XX 

DR WPI; 2000-116558/10. 
XX 

PT Cloning intact genes used to isolate genes for restriction enzymes. 
XX 

PS Claim 10; Page 62; 97pp; English. 
XX 

CC This invention describes a novel method for cloning intact, diversity - 

CC selected genes (I) from within gene cassettes (GC) which comprises 

CC identifying DNA repeats that flank GC, hybridizing oligonucleotides (ON) 

CC to these repeats and amplification to produce DNA fragments containing 

CC (I) , ligating these fragments into a vector and transforming cells with 

CC the vector. This method is used to clone a wide variety of prokaryotic 

CC genes that provide a selective advantage under particular conditions, 

CC particularly those that encode restriction enzymes (used as reagents in 

CC molecular biology) ; adhesins (for use in coating or for targeting 

CC molecules or organisms to particular sites, e.g. for competitive 

CC exclusion of a selected pathogen); detoxifying enzymes; toxins that 

CC interact with a host, e.g. for synthesis of inhibitors or antagonists of 

CC the toxin, or in vaccination, or a modification methyltransf erase . Intact 

CC genes can be cloned directly with a high probability that the orientation 

CC of expression is known in advance and low probability of association with 

CC extraneous, possibly toxic, genes. AAZ44894 -Z44980 represent the 

CC Pseudomonas alcaligenes repeat (PAR) elements described in the method of 

CC the invention 

XX 

SQ Sequence 33 BP; 6 A; 10 C; 12 G; 5 T; 0 U; 0 Other; 



Query Match 100.0%; Score 33; DB 3; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.00034; 

Matches 33; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 AAACTCGAGACCGCGCAGCGGGGTTTGCGTCCG 33 

I II III I II I! I 1 1 I III 1 1 III II 1 1 I II I 

Db 1 AAACTCGAGACCGCGCAGCGGGGTTTGCGTCCG 33 



Search completed: October 22, 2004, 07:28:22 
Job time : 152.514 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:55:41 ; Search time 32.3757 Seconds 

(without alignments) 
724.496 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-89 
33 

1 aaactcgagaccgcgcagcggggtttgcgtccg 33 



Scoring table : 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Issued_Patents_NA: * 

1: /cgn2_6/ptodata/l/ina/5A_COMB.seq:* 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB.seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB.seq: * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq : * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 32.3757 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 



- nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 



; Search time 132.624 Seconds 

( wi t hou t a 1 i gnment s ) 

1274.950 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-89 
33 

1 aaactcgagaccgcgcagcggggtttgcgtccg 33 



Scoring table: I DENT I T Y_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq:* 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB . seq: * 

9 : /cgn2_6/ptodata/2 /pubpna/US09A_PUBCOMB . seq : * 
10 : /cgn2_6/ptodata/2/pubpna/US09B_PUBCOMB. seq: * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB . seq : * 
13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB.seq:* 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq:* 
15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB.seq:* 
16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB. seq: * 
17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB . seq : * 
18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB . seq : * 
19 : /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB . seq : * 
20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq : * 
21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:44 
Job time : 133.624 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:54:36 ; Search time 1353.8 Seconds 

{without alignments) 
888.247 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-89 
33 

1 aaactcgagaccgcgcagcggggtttgcgtccg 33 



Scoring table: 



I D ENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 0 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : EST: * ' 

1: gb_estl:* 
2: gb_est2:* 
3 : gb_htc : * 
4: gb_est3:* 
5: gb_est4:* 
6: gb_est5:* 
7: gb_est6:* 
8: gb_gssl:* 
9: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1353.8 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



October 22, 2004, 06:27:10 ; Search time 631.995 Seconds 

(without alignments) 
2469.262 Million cell updates/sec 

US-09-701-626A-88 

1 aaactcgaggctgtccagcgagcgaagcgagcg 33 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



4526729 seqs, 23644849745 residues 



Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : GenEmbl : * 

1 : gb_ba : * 
2 : gb_htg : * . 
3 : gb__in : * 
4 : gb_om : * 

5 : gb_OV : * 

6 : gb_pat : * 

7 : 9*> J>h : * 

8 : gb_pl : * 
9 : gb_pr : * 
10: gb_ro:* 

11: gb_StS:* 
12: gb_sy:* 
!3 : gb__un:* 
14 : gb_vi : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
BD211364 Method of 



33 100.0 



33 



BD211364 



ALIGNMENTS 



RESULT 1 
BD211364 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD211364 33 bp DNA linear PAT 17-JUL-2003 

Method of finding restriction enzyme. 

BD211364 

BD211364.1 GI:33021134 

JP 2002517260-A/88. 

unidentified 

unidentified 

unclassified . 

1 (bases 1 to 33) 

Raleigh, E. A. , Vaisvila,R. and Morgan,R.D. 

Method of finding restriction enzyme 

Patent: JP 2002517260-A 88 18-JUN-2002; 

NEW ENGLAND BIOLABS INC 

OS Unknown 

PN JP 2002517260-A/88 

PD 18-JUN-2002 

PF ll-JUN-1999 JP 2000553622 

PR 12-JUN-1998 US 60/089086 , 12 -JUN- 1998 US 60/089101 PI 
ELISABETH A RALEIGH, ROMUALDAS VAIS VI LA, RICHARD D MORGAN PC 
C12Ql/68,C12N15/09,C12N15/00 

CC Description of Unknown Organism: Synthetic Oligonucleotide CC 
based on 

CC Pseudomonas Alcaligenes NEB#585 (ATCC 55044) 

FH Key Location/Qualifiers 

FT source 1. .33 

FT /organism= 'Unknown' . 

Location/Qualifiers 

1. .33 

/organism= "unidentified" 
/mol_type= "genomic DNA" 
/db xref="taxon: 32 644" 



Query Match 100.0%; Score 33; DB 6; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.0084; 

Matches 33; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



1 AAACTCGAGGCTGTCCAGCGAGCGAAGCGAGCG 33 

III III II I I II II III II ii II III III I ill 
1 AAACTCGAGGCTGTCCAGCGAGCGAAGCGAGCG 33 



Search completed: October 22, 2004, 09:26:35 
Job time : 631.995 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 22, 2004, 06:02:20 ; Search time 152.514 Seconds 



(without alignments) 

1135.840 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-88 
33 

1 aaactcgaggctgtccagcgagcgaagcgagcg 33 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4134886 seqs, 2624710521 residues 

Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Database : N_GeneseqL23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs : * 

8: geneseqn2003as : * 

9 : geneseqn2003bs : * 
10: geneseqn2003cs : * 
11 : geneseqn2 003ds : * 
12: geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44977 P. alcali 



33 100.0 



33 



AAZ44977 



ALIGNMENTS 



RESULT 1 
AAZ44977 

ID AAZ44 977 standard; DNA; 3 3 BP. 
XX 

AC AAZ44977; 
XX 

DT 16-MAY-2000 (first entry) 



DE P. alcaligenes repeat (PAR) element DNA #84. 
XX 

KW Diversity- selected gene; restriction enzyme; adhesin; toxin; 

KW detoxifying enzyme; repeat element; PAR; ss. 

XX 

OS Pseudomonas alcaligenes . 
XX 

PN W09964632-A1. 
XX 

PD 16-DEC-1999. 
XX 

PF ll-JUN-1999; 99WO-US0132 95 . 
XX 

PR 12-JUN-1998; 98US -0089086P . 

PR 12-JUN-1998; 98US-0089101P . 
XX 

PA (NEWE ) NEW ENGLAND BIOLABS INC. 
XX 

PI Raleigh EA, Vaisvila R, Morgan RD; 
XX 

DR WPI; 2000-116558/10. 
XX 

PT Cloning intact genes used to isolate genes for restriction enzymes. 
XX 

PS Claim 10; Page 62; 97pp; English. 
XX 

CC This invention describes a novel method for cloning intact, diversity - 

CC selected genes (I) from within gene cassettes (GC) which comprises 

CC identifying DNA repeats that flank GC, hybridizing oligonucleotides (ON) 

CC to these repeats and amplification to produce DNA fragments containing 

CC (I) , ligating these fragments into a vector and transforming cells with 

CC the vector. This method is used to clone a wide variety of prokaryotic 

CC genes that provide a selective advantage under particular conditions, 

CC particularly those that encode restriction enzymes (used as reagents in 

CC molecular biology) ; adhesins (for use in coating or for targeting 

CC molecules or organisms to particular sites, e.g. for competitive 

CC exclusion of a selected pathogen); detoxifying enzymes; toxins that 

CC interact with a host, e.g. for synthesis of inhibitors or antagonists of 

CC the toxin, or in vaccination, or a modification methyltransf erase . Intact 

CC genes can be cloned directly with a high probability that the orientation 

CC of expression is known in advance and low probability of association with 

CC extraneous, possibly toxic, genes. AAZ44894-Z44980 represent the 

CC Pseudomonas alcaligenes repeat (PAR) elements described in the method of 

CC the invention 

XX 

SQ Sequence 33 BP; 9 A; 9 C; 12 G; 3 T; 0 U; 0 Other; 



Query Match 100.0%; Score 33; DB 3; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.0004; 

Matches 33; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 1 AAACTCGAGGCTGTCCAGCGAGCGAAGCGAGCG 3 3 

1 1 II I Mill II I II I II III II I II I II I II I 

Db 1 AAACTCGAGGCTGTCCAGCGAGCGAAGCGAGCG 3 3 



Search completed: October 22, 2004, 07:28:22 
Job time : 153.514 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 
Run on: 



Title: 

Perfect score : 
Sequence : 



nucleic search, using sw model 

October 22, 2004, 06:55:41 ; Search time 32.3757 Seconds 

(without alignments) 
724.496 Million cell updates/sec 

US-09-701-626A-88 
33 

1 aaac tcgaggc tgt ccagcgagcgaagcgagcg 3 3 



Scoring table: 



Searched : 



I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

824507 seqs, 355394441 residues 



Total number of hits satisfying chosen parameters : 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB . seq : * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB. seq: * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ ina/ 6B_COMB . seq : * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB. seq: * 

6 : / cgn2_6/ptodata/l/ ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 32.3757 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 22, 2004, 06:59:51 ; Search time 132.624 Seconds 

(without alignments) 
1274.950 Million cell updates/sec 

US-09-701-626A-88 
33 

1 aaactcgaggctgtccagcgagcgaagcgagcg 3 3 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 3407233 seqs, 2561960514 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Database 



Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 

Published_Applications_NA: 1 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 



/cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB . seq: * 
/cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq:* 
/cgn2_6 /p t oda t a/ 2 /pubpna/US 0 6_PUBCOMB . seq : * 
/cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq:* 
/cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB. seq: * 
/cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 
/cgn2__6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 
/cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq : * 
/cgn2_6/ptodata/ 2 /pubpna/US 09B_PUBCOMB . seq : 
/cgn2__6/ptodata/ 2 /pubpna/US 09C_PUBCOMB . seq : 
/ cgn2_6/ptodat a/ 2 /pubpna/US 09_NEW_PUB. seq: * 
/cgn2_6/ptodata/ 2 /pubpna/US 10 A_PUBC0MB . seq : 
/cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : 
/cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : 
/cgn2_6/pt oda t a/ 2 /pubpna/US 10D_PUBCOMB . seq : 
/cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB . seq : 
/cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
/cgn2_6/ptodat a/ 2 /pubpna/US 11_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US60_NEW_PUB. seq: * 
/cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:43 
Job time : 132.624 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



October 22, 2004, 06:54:36 ; Search time 1353.8 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

US-09-701-626A-88 
33 

1 aaactcgaggctgtccagcgagcgaagcgagcg 33 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 32822875 seqs, 18219865908 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : EST: * 



1 




_estl : 




2 


gb_ 


~est2 : 


★ 


3 




_htC: * 




4 


gb_ 


_est3 : 


* 


5 


gb_ 


_est4 : 


* 


6 


gb_ 


_est5 : 


* 


7 


gb_ 


_est6 : 


* 


8 


gb_ 


gssl : 


* 


9 


gb_ 


_gss2 : 


* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1353.8 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 22, 2004, 06:27:10 ; Search time 631.995 Seconds 

{without alignments) 
2469.262 Million cell updates/sec 

Title: US-09-701-62 6A-87 

Perfect score: 33 

Sequence: 1 aaactcgagggtcccgagcgagcgaagcgagcg 33 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 4526729 seqs, 23644849745 residues 

Total number of hits satisfying chosen parameters: 1 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing : 



Minimum Match 
Maximum Match 
Listing first 



100% 
100% 

1000 summaries 



Database : GenEmbl : * 



1 


gb_ba : * 


2 


gb_htg : * 


3 


gb_in : * 


4 


gb_om : * 


5 


gb_ov : * 


6 


gb _pat : * 


7 


gb_ph : * 


8 


gbjpl : * 


9 


gb_pr : * 


10: gb_ro:* 


11: gb_StS:* 


12: gb_sy:* 


1 3 : gb_un : * 


14 : gb_vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



33 100.0 33 6 BD211363 BD211363 Method of 



ALIGNMENTS 



RESULT 1 
BD211363 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



BD211363 33 bp DNA linear PAT 17-JUL-2003 

Method of finding restriction enzyme. 

BD211363 

BD211363 . 1 GI: 33 021133 

JP 2002517260-A/87 . 

unidentified 

unidentified 

unclassified. 

1 (bases 1 to 33) 

Raleigh, E. A. , Vaisvila,R. and Morgan, R.D. 

Method of finding restriction enzyme 

Patent: JP 2002517260-A 87 18-JUN-2002; 

NEW ENGLAND BIOLABS INC 

OS Unknown 

PN JP 2002517260-A/87 

PD 18-JUN-2002 

PF ll-JUN-1999 JP 2000553622 

PR 12-JUN-1998 US 60/089086 , 12 -JUN- 1998 US 60/089101 PI 
ELISABETH A RALEIGH , ROMUALD AS VAISVILA, RICHARD D MORGAN PC 
C12Ql/68 / C12N15/09,C12N15/00 

CC Description of Unknown Organism: Synthetic Oligonucleotide CC 
based on 

CC Pseudomonas Alcaligenes NEB#585 (ATCC 55044) 

FH Key Location/Qualifiers 

FT source 1. .33 

FT /organism= 'Unknown ' . 

Location/Qualifiers 

1. .33 

/ organi sm= " uniderit i f i ed " 
/mol_type= "genomic DNA" 
/db xref="taxon: 32644" 



Query Match 100.0%; Score 33/ DB 6; Length 33; 

Best Local Similarity 100.0%; Pred. No. 0.03; 

Matches 33 ; Conservative 0 ; Mismatches 0 ; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 AAACTCGAGGGTCCCGAGCGAGCGAAGCGAGCG 33 

III II I II 1 1 III 1 1 II III II I II I II III!' 

1 AAACTCGAGGGTCCCGAGCGAGCGAAGCGAGCG 33 



Search completed: October 22, 2004, 09:26:35 
Job time : 632.995 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 22, 2004, 06:02:20 ; Search time 152.514 Seconds 



(without alignments) 

1135.840 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 

Scoring table : 



Searched : 



US-09-701-626A-87 
33 

1 aaactcgagggtcccgagcgagcgaagcgagcg 33 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 

4134886 seqs, 2624710521 residues 



Total number of hits satisfying chosen parameters: 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: 



Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : N_Geneseq_23Sep04 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002as : * v 

7 : geneseqn2002bs : * 

8: geneseqn2003as : * 

9: geneseqn2003bs : * 
10: geneseqn2003cs : * 
11 : geneseqn2003ds : * 
12 : geneseqn2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 
Aaz44976 P. alcali 



33 100.0 



33 3 AAZ44976 



ALIGNMENTS 



RESULT 1 
AAZ44976 

ID AAZ44976 Standard; DNA; 33 BP. 
XX 

AC AAZ44 976; 
XX 

DT 16-MAY-2000 (first entry) 



XX 

DE P. alcaligenes repeat (PAR) element DNA #83. 
XX 

KW Diversity- selected gene; restriction enzyme; adhesin; toxin; 

KW detoxifying enzyme; repeat element; PAR; ss. 

XX 

OS Pseudomonas alcaligenes. 
XX 

PN W09964632-A1. 
XX 

PD 16-DEC-1999. 
XX 

PF ll-JUN-1999; 99WO-US0132 95 . 
XX 

PR 12-JUN-1998; 98US - 0089086P . 

PR 12-JUN-1998; 98US-0089101P . 
XX 

PA (NEWE ) NEW ENGLAND BIOLABS INC. 
XX 

PI Raleigh EA, Vaisvila R, Morgan RD; 
XX 

DR WPI; 2000-116558/10. 
XX 

PT Cloning intact genes used to isolate genes for restriction enzymes. 
XX 

PS Claim 10; Page 62; 97pp; English. 
XX 

CC This invention describes a novel method for cloning intact, diversity - 

CC selected genes (I) from within gene cassettes (GC) which comprises 

CC identifying DNA repeats that flank GC, hybridizing oligonucleotides (ON) 

CC to these repeats and amplification to produce DNA fragments containing 

CC (I) , ligating these fragments into a vector and transforming cells with 

CC the vector. This method is used to clone a wide variety of prokaryotic 

CC genes that provide a selective advantage under particular conditions, 

CC particularly those that encode restriction enzymes (used as reagents in 

CC molecular biology) ; adhesins (for use in coating or for targeting 

CC molecules or organisms to particular sites, e.g. for competitive 

CC exclusion of a selected pathogen) ; detoxifying enzymes; toxins that 

CC interact with a host, e.g. for synthesis of inhibitors or antagonists of 

CC the toxin, or in vaccination, or a modification methyltransf erase . Intact 

CC genes can be cloned directly with a high probability that the orientation 

CC of expression is known in advance and low probability of association with 

CC extraneous, possibly toxic, genes. AAZ44894-Z44980 represent the 

CC Pseudomonas alcaligenes repeat (PAR) elements described in the method of 

CC the invention 

XX 

SQ Sequence 33 BP; 9 A; 9 C; 13 G; 2 T.; 0 U; 0 Other; 

Query Match 100.0%; Score 33; DB 3; Length 33; 
Best Local Similarity 100.0%; Pred. No. 0.00067; 

Matches 33 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 



Qy 1 AAACTCGAGGGTCCCGAGCGAGCGAAGCGAGCG 3 3 

I III II I II I III II III II I III Ml III I II 

Db 1 AAACTCGAGGGTCCCGAGCGAGCGAAGCGAGCG 33 



Search completed: October 22, 2004, 07:28:21 
Job time : 152.514 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



October 22, 2004, 06:55:41 ; Search time 32.3757 Seconds 

( wi t hou t a 1 i gnment s ) 
724.496 Million cell updates/sec 

US-09-701-626A-87 
33 

1 aaactcgagggtcccgagcgagcgaagcgagcg 3 3 

I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 824507 seqs, 355394441 residues 

Total number of hits satisfying chosen parameters: 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database : Issued_Patents_NA: * 

1 : /cgn2_6/ptodata/l/ina/5A_COMB. seq: * 

2 : /cgn2_6/ptodata/l/ina/5B_COMB . seq : * 

3 : /cgn2_6/ptodata/l/ina/6A_COMB. seq: * 

4 : /cgn2_6/ptodata/l/ina/6B_COMB . seq : * 

5 : /cgn2_6/ptodata/l/ina/PCTUS_COMB . seq: * 

6 : /cgn2_6/ptodata/l/ina/backf ilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



No matches found 



Search completed: October 22, 2004, 13:45:50 
Job time : 32.3757 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 



- nucleic search, using sw model 



Run on: 



October 22, 2004, 06:59:51 



; Search time 132.624 Seconds 

(without alignments) 

1274.950 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-701-626A-87 
33 

1 aaactcgagggtcccgagcgagcgaagcgagcg 33 



Scoring table: I DENT I TY_NUC 
Gapop 10.0 , 



Gapext 1 . 0 



Searched: 



3407233 seqs, 2561960514 residues 



Total number of hits satisfying chosen parameters: 



0 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



Published_Applications_NA: * 



1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB. seq: * 

2 : /cgn2_6/ptodata/2/pubpna/PCT_NEW_PUB.seq:* 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB.seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBC0MB.seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB. seq: * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq: * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB. seq: * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB . seq : * 

10 : /cgn2_6/ptodata/2/pubpna/US0 9B_PUBCOMB. seq: * 

11 : /cgn2_6/ptodata/2/pubpna/US0 9C_PUBCOMB . seq : * 

12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB. seq: * 

13 : /cgn2_6/ptodata/2/pubpna/US10A_PUBCOMB. seq: * 

14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB . seq : * 

15 : /cgn2_6/ptodata/2/pubpna/US10C_PUBCOMB . seq : * 

16 : /cgn2_6/ptodata/2/pubpna/US10D_PUBCOMB . seq : * 

17 : /cgn2_6/ptodata/2/pubpna/US10E_PUBCOMB . seq : * 

18 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 

19 : /cgn2_6/ptodata/2/pubpna/USll_NEW_PUB . seq : * 

20 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq: * 

21 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



% 

Query 

Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 14:10:43 
Job time : 132.624 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score : 
Sequence : 



October 22, 2004, 06:54:36 ; Search time 1353.8 Seconds 

(without alignments) 
888.247 Million cell updates/sec 

US-09-701-626A-87 
33 

1 aaactcgagggtcccgagcgagcgaagcgagcg 33 



Scoring table: I DENT I T Y_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



32822875 seqs, 18219865908 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 100% 
Maximum Match 100% 
Listing first 1000 summaries 



Database 



EST: * 



gb_estl : * . 
gb_est2 : * 
gb_htc : * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
gb_est6 : * 
gb_gssl : * 
gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



No matches found 



Search completed: October 22, 2004, 13:39:40 
Job time : 1353.8 sees 



SCORE OVER LENGTH SEARCHES 



Attached is a score over length search. This search was developed to overcome 
limitations in most standard search systems which favor large sequences with high 
scoring, but lesser overall identity over smaller sequences with higher overall identity. 
This search is especially useful for relatively small nucleic acid or polypeptide target 
sequences (antisense, fragments, probes, primers, RNAi, epitopes, haptens, etc.) claimed 
functionally via a form of hybridization and/or identity language and having defined 
upper and lower polynucleotide and or polypeptide length limits. 

The score over length search is performed by first running the query sequence using 
examiner-specified identity and polynucleotide or protein length limit parameters, and 
saving 65,000 hits and 0 alignments from each desired database. The resulting output is 
reformatted using a Microsoft Word macro and is imported into Excel. The summary 
table data are then sorted by the ratio of score of each hit sequence divided by its length 
and the accession numbers for all hits below the examiner's desired score over length 
parameters are deleted. The remaining accession numbers are used to pull the 
corresponding sequences from the databases into subdatabases enriched for good hits and 
the query sequence is re-run against these subdatabases to yield the final results. 

The score over length cutoff for this search is ^0 ? & 



Examiner Please Note: 
be scanned. 



This cover sheet should be included when submitting results to 



